Fall 2025

CSCI 544 · Applied NLP

Logistics

Time:
Tuesday/Thursday, 4:00–5:50 p.m.
Location:
SAL 101
Course site:
Brightspace ↗
Discussion:
Ed ↗
Mailing list:
csci544-25f@googlegroups.com (subject: CSCI544)

Teaching Team

Jieyu Zhao

Instructor: Jieyu Zhao

Office Hour: 30 mins after the class

Teaching Assistants

  • Rebecca Dorn
    Rebecca Dorn — Thursdays 9:30–10:30 AM
  • Thomas Reeves
    Thomas Reeves — Wednesdays 8:30–9:30 AM
  • Sahana Ramnath
    Sahana Ramnath — Mondays 11:00 AM–12:00 noon
  • Pengda Xiang
    Pengda Xiang — Tuesday 10:00–11:00 AM
  • Muzi Tao
    Muzi Tao — Tuesday 3:00–4:00 PM
  • Ziyi Liu
    Ziyi Liu — Friday 2:00–3:00 PM
  • Xu Wang
    Xu Wang — Wednesday 11:00 AM–12:00 PM

Introduction

This course covers both fundamental and cutting-edge topics in Natural Language Processing (NLP) with a focus on Language Models. NLP has been revolutionized by the advancement of large-scale language models, achieving state-of-the-art performance across a wide variety of tasks. This course will cover the fundamentals of language modeling and related topics in NLP, deep learning, and machine learning.

Students will gain familiarity with the capabilities of large language models as well as hands-on experience with building and evaluating small-scale language models. The class will also explore the real-world consequences of deploying language models, such as the ethics and harms associated with them.

News

Syllabus

Calendar subject to change. All work (except final report) is due 11:59 PM PT on the listed date.

Wk Date Topic Readings Due
1 Aug 26 Introduction to LMs and Course Overview
Aug 28 n-gram Models
2 Sep 2 n-gram LMs (Smoothing) + Logistic Regression
Sep 4 Logistic Regression (contd.)
3 Sep 9 Word Embeddings
Group Formation
Sep 11 Word Embeddings (contd.)
4 Sep 16 Feedforward Neural Nets
Sep 18 Backpropagation
HW1 Release
5 Sep 23 Recurrent Neural Networks
Quiz 1
Sep 25 Seq2Seq and Attention
Project Proposal
6 Sep 30 Transformers — Building Blocks
Oct 2 Transformers (contd.)
Mid-Semester Eval
7 Oct 7 Guest Lecture — PyTorch for Transformers (TA)
Oct 9 Fall Recess
8 Oct 14 Midterm Exam
Midterm ExamHW1 Due
Oct 16 Pre-training and Finetuning Transformers
HW2 Release
9 Oct 21 Tokenization and Generating from LMs
Oct 23 Language Generation
10 Oct 28 LLMs — Evaluation
Project Status Report
Oct 30 LLMs — In-context Learning, Scaling Law
11 Nov 4 LLMs — Post-Training (TA Xu Wang)
Nov 6 Guest Lecture on LLM Agents (Qingyun Wu, hosted by Xu Wang)
12 Nov 11 Veterans Day Holiday
Nov 13 Guest Lecture on LLMs (Swabha Swayamdipta)
13 Nov 18 Guest Lecture: LLM + Medical (Ruishan Liu)
Quiz 2HW2 Due
Nov 20
Project Presentations I
14 Nov 25
Project Presentations II
Nov 27 Thanksgiving
15 Dec 2
Project Presentations III
Dec 4
Project Presentations IV
16 Dec 9
Dec 11
17 Dec 16
Final Report (6:30 PM)

Contact

Students should ask all course-related questions in the Ed forum, where you’ll also find announcements. The course Ed page is linked from the Brightspace page. TA office-hour Zoom links are available on Brightspace/Ed. For external enquiries, emergencies, or personal matters you don’t wish to put in a private Ed post, email csci544-25f@googlegroups.com with CSCI544 in your subject line. Please send all emails to this mailing list — do not email the instructors directly. We try to respond within 48–72 hours.

Assignments & Grading

  • Homeworks (20%) — 10% × 2: Two coding homework assignments based on class topics.
  • Quizzes (10%) — 5% × 2: Multiple-choice and short-answer questions. Missed quizzes receive a zero; no make-up quizzes.
  • Class Projects (50%) — Each student does a group project based on class topics. Students propose their own project, do the research and build a proof-of-concept, create a video demonstration, and present the project in their report.
    • Proposal: 5%
    • Status Reports: 10%
    • Project Presentation: 15%
    • Final Write-up: 20%
  • Exams (20%) — Midterm (20%): Mixture of multiple-choice and long-form questions, covering the first half of the material.

Grading inquiries can be asked (to the TAs) within two weeks from the grading date. Grades will be available within 2–2.5 weeks after submission.

All written assignments related to the final project should use the standard *ACL paper submission template.

Project Deliverables

Project proposal (5%)

Student teams should submit a ~1-page proposal (using the *CL paper submission template) for their project. The proposal should:

  • state and motivate the problem by providing a problem or task definition (preferably with example inputs and expected outputs),
  • situate the problem within related work (this might help you find sources of data for training a model for your task),
  • include related work / publications — start by looking in the ACL anthology,
  • references do not count towards page limit, but please follow the correct format,
  • state a hypothesis to be verified and how to verify it (evaluation framework), and
  • provide a brief description of the approach to be followed to verify the hypothesis (such as proposed models and baselines).

We highly encourage students to work towards a problem involving predictive models. It’s worth thinking about the five key ingredients of supervised learning: data, model, loss function, optimization algorithm, and inference / evaluation.

Project progress report (10%)

Student teams should submit a ~3-page progress report (using the *CL paper submission template). This report should:

  • once again describe the project’s goals (it’s okay if this has changed slightly since the proposal, based on feedback),
  • contain all details on the dataset (your dataset should mostly be collected by this time),
  • contain some initial results (motivating results), and
  • outline a concrete plan of what will be done before the final report.

While the initial results might be inconclusive, you are expected to have made non-trivial progress by this point. The project proposal may be extended for this report. Please take into consideration the earlier feedback you received, and address those inline.

Project final presentation (15%)

Each team will prepare a 5-minute presentation, followed by 1-2 minutes of Q&A. You can choose a representative for your team. Use one slide to clearly describe which team member is responsible for which part. Points will be deducted if the time limit is violated, so please practice timing your talk.

Each presentation should describe:

  • the underlying motivation of the project,
  • the research questions answered in the project,
  • the proposed methods,
  • the findings so far,
  • the contribution of each member, and
  • audience questions answered.

If you are in the audience, you can earn bonus points by asking insightful questions (and clearly announcing your name before asking).

Each team will prepare slides (via Google Slides) and add the link in the course presentation spreadsheet shared through Brightspace/Ed by 11:59 PM the day before the presentation. Failure to share slides on time will cause a loss of grade.

Project final report (20%)

Student teams should submit a ~4-6 page final report (using the *CL paper submission template) detailing all aspects of their project. The report should be structured like a conference paper, including:

  • an abstract,
  • an introduction to the problem and method,
  • related work, highlighting similarities and differences to your own work,
  • a description of the method used,
  • the experiments and results, and
  • a discussion of the results, outlining future work possibilities.

A tech report format is discouraged. Parts of the proposal and progress report may be reused for the final report. Negative results will not be penalized, but should be accompanied with detailed analysis of why the proposed method did not work as anticipated. You may include an appendix at the end. References and the appendix do not count towards the main report page limit. You must submit all your code as a final deliverable (zip file). Plagiarism will be strictly penalized.

Late Days

Students are allowed a maximum of 4 late days total for all assignments (but NOT the quizzes). You may use up to 2 late days per assignment. Using one late day for a project assignment involves each teammate using a late day. Partial late days are not permitted. For every extra late day beyond the allowed limit, the student/team will lose 20% of the grade for that assignment.

Please familiarize yourself with the academic policies and read the note about student well-being.

Reference Texts

These texts are useful but none are required. All are free to read online.