Large Language Models

Large Language Models#

Course Description#

This advanced course imparts a comprehensive understanding of Large Language Models (LLMs) with a focus on their architecture, application paradigms, and ethical implications. Structured over 15 weeks, the course is tailored for students with a background in machine learning and natural language processing. It features hands-on training, in-depth analysis of scholarly papers, a midterm examination, and a final project centering on the development of a practical LLM application.

Learning Goals#

Upon completing this course, students will be able to:

Understand the architectural intricacies of leading LLMs like BERT, T5, and GPT-3.
Utilize specialized techniques such as few-shot learning, prompt engineering, and in-context learning in LLMs.
Investigate and address ethical concerns including bias and data privacy.
Implement a real-world LLM application as part of the final project.
Critically evaluate peer projects through a formal review process.

Grading#

Participation: 10%
Midterm Exam: 25%
Peer Reviews of Final Project: 5%
Final Project: 60%

Final Project#

The final project mandates students to create a real-world application using a large language model. The project involves data pre-processing, model training/fine-tuning, evaluation, and documentation. Students will also participate in peer reviews to critically evaluate the projects of their peers. The deliverables include a functional LLM application and a research paper.