CMSC 28000: Introduction to Formal Languages (Spring 2022)
General information
- Instructor
- Timothy Ng (timng@uchicago.edu)
Office hours: MWF 10:30–11:30, JCL 208
- Teaching assistants
- See Ed.
- Class meeting
- MWF 9:30–10:30, Pick Hall 016
- Links
-
Overview
This course explores the mathematical foundations of computation. How do we quantify computational power? Are there limits on the kinds of problems that computers can solve? To answer such questions, we examine the curious connection between computation and mathematical linguistics.
Communication
There are a number of different ways we'll be communicating about the class.
- Course materials
- Lecture notes will be made available via this course website. The course website also contains basic information about the course (i.e. you can treat this like a syllabus).
- Discussion and announcements
- We will use Ed Discussion for course discussion and announcements. Restricted course materials will also be posted here.
- Coursework and grades
- We will use Gradescope for distributing and receiving coursework. Your grades will also be available here.
- Office hours
- Office hours are times when the course staff are available for you. The instructor and teaching assistants will have scheduled office hours in-person and/or online. While most students use this as an opportunity to ask about coursework, you're free to ask about or discuss things that are related directly or indirectly with the course.
Course materials and objectives
The following is a list of topics that will be covered in the course.
- Regular languages
- Deterministic and non-deterministic finite automata, closure properties, regular expressions, the pumping lemma for regular languages, DFA minimization, the Myhill-Nerode theorem, derivatives.
- Context-free languages
- Context-free grammars, normal forms, the pumping lemma for context-free languages, pushdown automata, closure properties.
- Decidable and undecidable languages
- Turing machines, the Church-Turing thesis, computability, undecidability, the Halting Problem, reductions.
Upon completion of the course, students should be able to
- Describe how computation can be modeled mathematically.
- Describe how computational problems can be described as formal languages.
- Classify formal languages according to their complexity.
- Describe and transform formal languages across their different representations.
- Describe the limits of various computational models.
Text
There is no required textbook for this course. Lecture notes will be made available. Lectures are based largely on the following sources—you can consider one of these if you're looking for a reference.
- Dexter Kozen. Automata and Computability (1997). This is available online for free via the UChicago Library.
- Michael Sipser. Introduction to the Theory of Computation, 3rd ed. (2012). Any edition of this would be a solid reference.
The lectures also incorporate material from the following, to a lesser extent.
- John Hopcroft and Jeffrey Ullman. Introduction to Automata Theory, Languages, and Computation, 1st ed. (1979).
- Jeffrey Shallit. A Second Course in Formal Languages and Automata Theory (2008).
- Jacques Sakarovitch. Elements of Automata Theory (2009).
If you refer to one of these texts, keep in mind that some notation and definitions will vary. In such cases, the course notes will take precedence.
Evaluation
Your computed grade in this course will be determined by the following coursework components.
- Weekly problem sets, worth 40% in total. Let $P$ be the average of the grades earned on problem sets.
- A midterm exam, worth at most 20%. Let $M$ be the grade earned on the midterm exam.
- A final exam, worth at least 40%. Let $F$ be the grade earned on the final exam.
- Biweekly personal reflections each worth 1%. Let $R$ be the number of completed reflections.
Let $S = (60\% \times P) + (1\% \times R)$. Then your computed grade will be obtained via the following formula:
\[(40\% \times P) + (20\% \times M) + (F \times (60\% - (20\% \times M)) + (1\% \times R).\]
Problem sets
Problem sets will be released on Wednesdays and will be due on the following Wednesday at 9:00 pm (21:00) Central.
Problem sets will be distributed and submitted via Gradescope. Please ensure that submissions are legible. See this guide for submitting on Gradescope.
Collaboration and citation
Students are expected to write up solutions to problem sets individually, but may work together. The work that you hand in is expected to be your own. Be sure to acknowledge your collaborators and any sources beyond course materials that you may have used.
- If you choose to work with others, you must list your collaborators. You may only work with other students in the class. Your collaboration should be to a degree that your writeups are substantially different. It is not acceptable that solutions are largely copies of each other.
- You should not look up solutions to assigned problems, online or otherwise. This includes question and answer forums, homework websites, and other publicly available materials from other courses in the department or at other institutions. Be aware that if you can find a solution online, the course staff can too.
- If you use a source beyond course materials, the source must be a published source—either a book or journal article—and full citation must be given. If you are not sure whether your source is a published source, it probably isn't.
How solutions are graded
Your solutions on problem sets and exams are judged on the following basis:
- Validity. Your solution will be judged on its correctness. This includes whether a method was applied correctly, calculations are correct, and proof steps follow the rules of logic.
- Presentation. Your solution will be judged on its presentation as it relates to readability and fluency. A solution may be correct, but is unable to communicate this fact to the reader. A readable solution should guide the reader through the argument, providing the appropriate explanation and justification. By fluency, we mean the correct use of the language of mathematics. This includes the proper use of established terminology and notation.
Grading for problem sets is based on the overall quality of the solution. Clearly, solutions need to be valid in order to be evaluated highly, but a completely valid solution that is not readable will still be evaluated poorly. Furthermore, grades are assigned as a qualitative evaluation of the work and not a quantitative accounting of the work. This appears as a 4-point scale on Gradescope.
- 4 (Excellent). The solution is correct and readable. The solution may have a few trivial flaws in logic or presentation that can be easily corrected.
- 3 (Good). The solution is generally correct and readable and has a some minor flaws in logic or presentation. Understanding of the problem has been demonstrated. The solution and presentation can be improved quickly with a few suggestions.
- 2 (Borderline). The solution could be correct and readable but has a major flaw in logic or presentation. There is evidence that there is some understanding of the problem. The solution and presentation can be improved with some substantial revisions.
- 1 (Needs revision). It is not clear that the problem was understood and/or the presentation of the solution is heavily flawed. The solution should be revised with some guidance.
- 0 (Incomplete). The solution was not submitted or is clearly incomplete.
The assigned grade is the grader’s judgement of whether the solution meets the standards of the class. In addition to the assigned grade, the grader is expected to provide detailed feedback, addressing specific flaws in the submitted work that can and should be improved in future work.
Resubmission
Part of the learning process is identifying and correcting mistakes. After your submissions have been graded and returned to you, you will have the opportunity to use the feedback you receive to revise and resubmit your work.
Please follow the instructions for resubmission carefully. An assignment on Gradescope will be available for resubmissions of the problem set. There are a lot of different assignments on Gradescope, so please read carefully and make sure you’re submitting to the correct one. To prepare your resubmission, you should include:
- the first page of the graded copy of the submission, which contains a summary of the grading (described below) and indicating clearly which solutions are being resubmitted,
- your revised and original solutions, clearly indicated, for problems you wish to resubmit,
- a response to the grader for each problem you wish to resubmit, indicating explicitly how you have implemented the grader’s feedback.
To get a copy of your graded problem set, use the Download Graded Copy button. The first page of this document is an outline of the grading. Include this page in your resubmission and use it to refer to the graders’ feedback in your response. If you have questions about a grader’s comments, you should contact the course staff by making a post visible only to course staff on Ed.
- Only complete solutions may be resubmitted. The point of this process is to allow you to incorporate the feedback you receive from grading. Incomplete solutions will have no actionable feedback, defeating the purpose of resubmission.
- You do not need to resubmit the entire problem set if you only want to resubmit a solution for one problem.
- You do not need to resubmit any part of the problem set if you are satisfied with your grade.
- If you resubmit part of the problem set, for all problems you are not resubmitting, please select the grading summary page for that problem.
- Your resubmissions will be graded by the same grader.
Regrade requests
You may submit a regrade request in the event of an error by the grader. That is, if the feedback provided by the grader is a factual error, you may request a review of the grading. Please indicate the source of the error in this case.
We will not consider regrade requests concerning disagreement with a grader’s evaluation of your work. In such cases, you should consider the feedback that was given and apply it towards revision and resubmission of your work.
Lectures
- March 28
- Introduction, Computation, Strings
- March 30
- Languages, finite automata
- April 1
- Divisiblity, closure properties, product automaton
- April 4
- String operations, nondeterminism
- April 6
- The subset construction, $\varepsilon$-NFAs
- April 8
- Regular languages and regular expressions
- April 11
- Finite automata and regular expressions
- April 13
- Derivatives of regular expressions
- April 15
- Non-regular languages and the pumping lemma
- April 18
- The Myhill–Nerode theorem
- April 20
- DFA minimization
- April 22
- Grammars and context-free languages
- April 25
- Properties of grammars
- April 27
- Midterm exam
- April 29
- Chomsky normal form, CYK
- May 2
- Pushdown automata
- May 4
- Equivalence of context-free grammars and pushdown automata
- May 6
- Closure properties of context-free languages
- May 9
- Non-context-free languages and the pumping lemma
- May 11
- Computability and Turing machines
- May 13
- Working with Turing machines
- May 16
- Decidable languages and decision problems on formal languages
- May 18
- Diagonalization and undecidability
- May 20
- Reduction
- May 23
- Rice's theorem
- May 25
- Undecidable problems about context-free languages
Academic integrity
It is your responsibility to be familiar with the University’s policy on academic honesty. Instances of academic dishonesty will be referred to the Office of the Provost for adjudication. Following the guidelines above on collaboration and citation should be sufficient, but if you have any questions, please ask me.
Accessibility
Students with disabilities who have been approved for the use of academic accommodations by Student Disability Services (SDS) and need reasonable accommodation to participate fully in this course should follow the procedures established by SDS for using accommodations. Timely notifications are required in order to ensure that your accommodations can be implemented. Please meet with me to discuss your access needs in this class after you have completed the SDS procedures for requesting accommodations.