EECS595: Natural Language Processing Homework 4: Probabilistic Context Free Grammar and Dependency Parsing

Engage in a Conversation

EECS595: Natural Language Processing Homework 4, Fall 2023
Due 10/30/2023 CourseNana.COM

Student Name: xxx — uniqname: xxx CourseNana.COM

Submission Guidelines CourseNana.COM

1. Please insert your student information in line 63 of this LATEX file;
2. Please insert your answers between each pair of \begin{solution} and \end{solution}; 3. Zip the files and submit to Canvas. Checklist: hw4.pdf. CourseNana.COM

Problem 1: Probabilistic Context Free Grammar CourseNana.COM

Your friend decides to build a Treebank. He finally produces a corpus which contains the following three parse trees: CourseNana.COM

NP John CourseNana.COM

S CourseNana.COM

V1 said CourseNana.COM

VP CourseNana.COM

COMP that CourseNana.COM

SBAR CourseNana.COM

NP Sally CourseNana.COM

S CourseNana.COM

1 CourseNana.COM

VP V2 snored CourseNana.COM

ADVP loudly CourseNana.COM

VP CourseNana.COM

S CourseNana.COM

NP Sally CourseNana.COM

VP CourseNana.COM

SBAR COMP CourseNana.COM

V1 declared CourseNana.COM

S CourseNana.COM

that CourseNana.COM

NP
Bill VP ADVP CourseNana.COM

NP Fred CourseNana.COM

S CourseNana.COM

V1 pronounced CourseNana.COM

VP CourseNana.COM

COMP that CourseNana.COM

VP CourseNana.COM

V2 quickly ran CourseNana.COM

SBAR CourseNana.COM

NP Jeff CourseNana.COM

S CourseNana.COM

VP CourseNana.COM

VP ADVP V2 elegantly CourseNana.COM

swam CourseNana.COM

You then purchase the Treebank and decide to build a PCFG, and a parser, using your friend’s data. Now answer the following three questions: CourseNana.COM

1. (Written) Show the PCFG that you would derive from this Treebank. CourseNana.COM

2. (Written) Show two parse trees for the string “Jeff pronounced that Fred snored loudly”, and calculate their probabilities under the PCFG. CourseNana.COM

3. (Written) You are surprised that “Jeff pronounced that Fred snored loudly” has two possible CourseNana.COM

Solution: CourseNana.COM

Page 2 CourseNana.COM

parses, and that one of them - that Jeff is doing the pronouncing loudly - has relatively high probability. This type of high attachment is never seen in the corpus, so the PCFG is clearly missing something. You decide to fix the Treebank, by altering some non-terminal labels in the corpus. Show one such transformation which results in a PCFG that gives zero probability to parse trees with high attachments. (Your solution should systematically refine some non- terminals in the Treebank, in a way that slightly increases the number of non-terminals in the grammar, but allows the grammar to capture the distinction between high and low attachment to VPs.) CourseNana.COM

Problem 2: Dependency Parsing CourseNana.COM

This exercise is to get you familiar with dependency parsing and the Stanford CoreNLP [1] toolkit. You may also need to consult the inventory of universal dependency relations. You have two options to complete this exercise. CourseNana.COM

• Install the toolkit. Please check Stanza and follow the instructions to install the toolkit. You may need to use the toolkit for your final project. CourseNana.COM

• Run the demo system. You can also use the demo system without installing the toolkit.
You should experiment with different sentences and paragraphs to get some feeling about how the CourseNana.COM

parser works. In particular, you need to run the following paragraph and answer some questions. CourseNana.COM

The unveiling event for the innovative ChatGPT was shared online yesterday. This event, powered by the potent GPT-4, was projected for next month but was expedited after AI enthusiasts showed an enormous interest. All individuals now have the chance to explore its advanced capabilities. The AI community, though already familiar with preceding models, is buzzing with discussions and analyses. OpenAI confirmed that GPT-3.5/GPT-4 was the driving force behind ChatGPT, leading to its accelerated launch and widespread acclaim. CourseNana.COM

Please answer the following questions:
1. (Written) Give three examples where the parsed results are incorrect. CourseNana.COM

2. (Written) What would be the correct relation for each of these examples you identified above? Consult the university dependency documentation of relations to answer this question. CourseNana.COM

Solution: CourseNana.COM

Page 3 CourseNana.COM

CourseNana.COM

3. (Written) What is your general impression on the parsed results? Does the length of the sentence affect the performance? CourseNana.COM

References CourseNana.COM

[1] Manning, C. D., Surdeanu, M., Bauer, J., Finkel, J. R., Bethard, S., & McClosky, D. (2014, June). The Stanford CoreNLP natural language processing toolkit. In Proceedings of 52nd annual meeting of the association for computational linguistics: system demonstrations (pp. 55-60). CourseNana.COM

CourseNana.COM

EECS595: Natural Language Processing Homework 4: Probabilistic Context Free Grammar and Dependency Parsing

Get in Touch with Our Experts