CSCI 544 Applied NLP: Proposing a Group Project

Due Date: October 14 by 11:59pm, early submissions welcomed!

For your group project, you will need to form a group of three to four students, and pick an NLP-related task for your project. Your group needs to start by submitting a proposal by filling out the form here (only ONCE per group):

Online proposal submission.

The form asks you to describe:

You will need to list each group member and what tasks they will perform. Each project member will write an individual report on the project, and receive an individual grade. In addition, the group is required to give an in-class presentation. Currently, we have allocated the last three lectures for the presentations, but may need to include additional lectures depending on the number of groups. After you submit your project proposal, your group will receive an email from either Mark or Kallirroi. The email will either approve the proposal, or give comments and ask for revisions.

Question: Can I form a group with students from the other section?
Answer: Yes, assuming the entire group can attend either the 4pm or 6pm lecture during the day their group is presenting.

Help in picking a topic

Domain adaptation

Domain adaptation is a topic that can be explored in a variety of NLP tasks. For many of these tasks, there are large corpora and software tools trained on these corpora. However, performance of the tools may drop when used in domains different from the training corpus. The idea of domain adaptation is to annotate a small amount of training data in the new domain and create a model based on both the large out-of-domain corpus and the small in-domain corpus. This can be applied to NLP tasks such as part of speech tagging, named entity recognition and parsing.

Speech recognition / speech synthesis topics

NLU topics

Discourse topics

Dialogue topics

Help in finding data

Useful tools and resources for speech recognition

Useful tools and resources for language modeling

Useful tools and resources for speech synthesis

Useful tools for building dialogue systems