CMSC 28000: Introduction to Formal Languages (Spring 2023)
General information
- Instructor
- Timothy Ng (timng@uchicago.edu)
- Class meeting
- MWF 9:30–10:30, HM 140
- Links
-
Overview
This course explores the mathematical foundations of computation. How do we quantify computational power? Are there limits on the kinds of problems that computers can solve? To answer such questions, we examine the curious connection between computation and mathematical linguistics.
Communication
There are a number of different ways we'll be communicating about the class.
- Course materials
- Lecture notes will be made available via this course website. The course website also contains basic information about the course (i.e. you can treat this like a syllabus).
- Discussion and announcements
- We will use Ed Discussion for course discussion and announcements. Restricted course materials will also be posted here.
- Coursework and grades
- We will use Gradescope for distributing and receiving coursework. Your grades will also be available here.
- Office hours
- Office hours are times when the course staff are available for you. The instructor and teaching assistants will have scheduled office hours in-person and/or online. While most students use this as an opportunity to ask about coursework, you're free to ask about or discuss things that are related directly or indirectly with the course.
Course materials and objectives
The following is a list of topics that will be covered in the course.
- Regular languages
- Deterministic and non-deterministic finite automata, closure properties, regular expressions, Brzozowski derivatives, the pumping lemma for regular languages, DFA minimization, the Myhill-Nerode theorem.
- Context-free languages
- Context-free grammars, normal forms, the pumping lemma for context-free languages, pushdown automata, closure properties.
- Decidable and undecidable languages
- Turing machines, the Church-Turing thesis, computability, undecidability, the Halting Problem, reductions.
Upon completion of the course, students should be able to
- Describe how computation can be modeled mathematically.
- Describe how computational problems can be described as formal languages.
- Classify formal languages according to their complexity.
- Describe and transform formal languages across their different representations.
- Describe the limits of various computational models.
Text
There is no required textbook for this course. Lecture notes will be made available. Lectures are based largely on the following sources—you can consider one of these if you're looking for a reference.
- Dexter Kozen. Automata and Computability (1997). This is available online for free via the UChicago Library.
- Michael Sipser. Introduction to the Theory of Computation, 3rd ed. (2012). Any edition of this would be a solid reference.
The lectures also incorporate material from the following texts, to a lesser extent.
- John Hopcroft and Jeffrey Ullman. Introduction to Automata Theory, Languages, and Computation, 1st ed. (1979).
- Jeffrey Shallit. A Second Course in Formal Languages and Automata Theory (2008).
- Jacques Sakarovitch. Elements of Automata Theory (2009). This is also available online for free through the UChicago library.
If you refer to one of these texts, keep in mind that some notation and definitions will vary. In such cases, the course notes will take precedence.
Evaluation
Your computed grade in this course will be determined by the following coursework components.
- Weekly problem sets, worth 50% in total.
- A project, worth 10%.
- A final examination, worth 40%.
Problem sets
Problem sets will be released on Wednesdays and will be due on the following Thursday at 8:00 pm (20:00) Central.
Problem sets will be distributed and submitted via Gradescope.
- Please ensure that submissions are legible if handwritten! This includes making sure that your writing is neat as well as ensuring that your photos or scans are of reasonable quality.
- Take the opportunity to learn how to typeset mathematis with $\LaTeX$! In addition to many resources you can find online, all of the $\LaTeX$ commands used in the lecture notes is easily accessible by right-clicking on any formula, and selecting "Show Math As" $\rightarrow$ "TeX Commands$ from the context menu.
Collaboration and citation
The purpose of problem sets is to give you the opportunity to practice working through the process of solving problems and to receive feedback on that process. The work that you hand in to be graded is expected to be your own so that the feedback you receive will be the most useful. Be sure to acknowledge your collaborators and any sources beyond course materials that you may have used.
- If you choose to work with others, you must list your collaborators. You may only work with other students in the class. Your collaboration should be to a degree that your writeups are substantially different. It is not acceptable that solutions are largely copies of each other.
- You should not look up solutions to assigned problems, online or otherwise. This includes question and answer forums, homework websites, and other publicly available materials from other courses in the department or at other institutions. Be aware that if you can find a solution online, the course staff can too.
- If you use a source beyond course materials, the source must be a published source—either a book or journal article—and full citation must be given. If you are not sure whether your source is a published source, it probably isn't.
How solutions are graded
Your solutions on problem sets and exams are judged on the following basis:
- Validity. Your solution will be judged on its correctness. This includes whether a method was applied correctly, calculations are correct, and proof steps follow the rules of logic.
- Presentation. Your solution will be judged on its presentation as it relates to readability and fluency. A solution may be correct, but is unable to communicate this fact to the reader. A readable solution should guide the reader through the argument, providing the appropriate explanation and justification. By fluency, we mean the correct use of the language of mathematics. This includes the proper use of established terminology and notation.
Grading of problems is based on the overall quality of the solution. Clearly, solutions need to be valid in order to be evaluated highly, but a completely valid solution that is not readable will still be evaluated poorly. Furthermore, grades are assigned as a qualitative evaluation of the work and not a quantitative accounting of the work. This appears as a 4-point scale on Gradescope.
Generally, the overarching principle that guides the grading is: How much work and guidance is needed for you to revise your submission into something that is Excellent?
- 4 (Excellent). The solution is correct and readable. The solution is not necessarily perfect—it may have a few trivial flaws in logic or presentation that can be easily corrected.
- 3 (Good). The solution is generally correct and readable and has some minor flaws in logic or presentation—often there is one last "step" that is missing. Understanding of the problem has been clearly demonstrated. The solution and presentation can be improved quickly with a few suggestions—for instance, a bit more generalization or one more logical step that needs to be taken.
- 2 (Borderline). The solution could be correct and readable but has a major flaw in logic or presentation. There is evidence that there is some understanding of the problem and the solution is on the right track or has the right idea. The solution and presentation can be improved with some guidance and substantial revisions.
- 1 (Needs revision). The solution is generally not correct or not readable. It is not clear that the problem was understood and/or the presentation of the solution is heavily flawed. In either case, there is a significant gap in understanding or execution. The solution should be revised with some guidance.
- 0 (Incomplete). The solution was not submitted or is clearly incomplete.
The assigned grade is the grader’s judgement of whether the solution meets the standards of the class. In addition to the assigned grade, the grader is expected to provide detailed feedback, addressing specific flaws in the submitted work that can and should be improved in future work.
Resubmission
Part of the learning process is identifying and correcting mistakes. After your submissions have been graded and returned to you, you will have the opportunity to use the feedback you receive to revise and resubmit your work.
Regrade requests
You may submit a regrade request in the event of an error by the grader. That is, if the feedback provided by the grader is a factual error, you may request a review of the grading. Please indicate the source of the error in this case.
We will not consider regrade requests concerning disagreement with a grader’s evaluation of your work. In such cases, you should consider the feedback that was given and apply it towards revision and resubmission of your work.
Lectures
Section numbers refer to Kozen.
- March 20
- The theory of computation, strings and words [1,2]
- March 22
- Languages, finite automata [2,3]
- March 24
- Finite automata, divisibility [3,4]
- March 27
- Closure properties, the product automaton [4]
- March 29
- Nondeterministic finite automata [5]
- March 31
- The subset construction, $\varepsilon$-transitions [6]
- April 3
- Regular languages and regular expressions [7,8]
- April 5
- Finite automata and regular expressions [9]
- April 7
- Derivatives of regular expressions [Sakarovitch 4.4]
- April 10
- Non-regular languages [11,12]
- April 12
- The Myhill–Nerode theorem [15,16]
- April 14
- DFA minimization [13,14]
- April 17
- Grammars and context-free languages [19]
- April 19
- Properties of context-free grammars [19,21]
- April 21
- Chomsky normal form, parsing [21,27]
- April 24
- Pushdown automata [23]
- April 26
- Equivalence of of context-free grammars and pushdown automata [E,24]
- April 28
- Closure properties of context-free languages [25,27]
- May 1
- Non-context-free languages [22]
- May 3
- Computability and Turing machines [28]
- May 5
- Working with Turing machines [29,30]
- May 8
- Decision problems on formal languages [Sipser 4.1]
- May 10
- Universal machines and diagonalization [31]
- May 12
- Undecidable problems and Rice's theorem [32,34]
- May 15
- Decision problems on context-free languages [35]
- May 17
- Post's correspondence problem and reduction [33]
- May 19
- Q&A
Academic integrity
It is your responsibility to be familiar with the University’s policy on academic honesty. Instances of academic dishonesty will be referred to the Office of the Provost for adjudication. Following the guidelines above on collaboration and citation should be sufficient, but if you have any questions, please ask me.
Accessibility
Students with disabilities who have been approved for the use of academic accommodations by Student Disability Services (SDS) and need reasonable accommodation to participate fully in this course should follow the procedures established by SDS for using accommodations. Timely notifications are required in order to ensure that your accommodations can be implemented. Please meet with me to discuss your access needs in this class after you have completed the SDS procedures for requesting accommodations.