Files

Arianna Masciolini 5984895de8 all the slides for tomorrrow

2025-05-19 22:04:12 +02:00

6.6 KiB

Raw Blame History

title, subtitle, author, theme, logo, date, institute

title	subtitle	author	theme	logo	date	institute
Training and evaluating \newline dependency parsers	(added to the course by popular demand)	Arianna Masciolini	lucid	gu.png	VT25	LT2214 Computational Syntax

Today's topic

\bigskip \bigskip

Parsing

A structured prediction task

Sequence \to structure, e.g.

natural language sentence \to syntax tree
code \to AST
argumentative essay \to argumentative structure

Example (argmining)

Språkbanken has better fika than CLASP: every fika, someone bakes. Sure, CLASP has a better coffee machine. On the other hand, there are more important things than coffee. In fact, most people drink tea in the afternoon.

Example (argmining)

\footnotesize From "A gentle introduction to argumentation mining" (Lindahl et al., 2022)

Syntactic parsing

From sentence to tree

From Jurafsky & Martin. Speech and Language Processing, chapter 18 (January 2024 draft):

Syntactic parsing is the task of assigning a syntactic structure to a sentence

the structure is usually a syntax tree
two main classes of approaches:
- constituency parsing (e.g. GF)
- dependency parsing (e.g. UD)

Example (GF)

MicroLang> i MicroLangEng.gf 
linking ... OK

Languages: MicroLangEng
7 msec
MicroLang> p "the black cat sees us now"
PredVPS (DetCN the_Det (AdjCN (PositA black_A) 
(UseN cat_N))) (AdvVP (ComplV2 see_V2 (UsePron 
we_Pron)) now_Adv)

Example (GF)

PredVPS (
    DetCN 
        the_Det 
        (AdjCN (PositA black_A) (UseN cat_N))
    ) 
    (AdvVP 
        (ComplV2 see_V2 (UsePron we_Pron)) 
        now_Adv
    )

Example (GF)

Dependency parsing

Example (UD)

\small

1	the	_	DET	_	_	3	det	_	_
2	black	_	ADJ	_	_	3	amod	_	_
3	cat	_	NOUN	_	_	4	nsubj	_	_
4	sees	_	VERB	_	_	0	root	_	_
5	us	_	PRON	_	_	4	obj	_	_
6	now	_	ADV	_	_	4	advmod	_	_

Two paradigms

graph-based algorithms: find the optimal tree from the set of all possible candidate solutions or a subset of it
transition-based algorithms: incrementally build a tree by solving a sequence of classification problems

Graph-based approaches

\hat{t} = \underset{t \in T(s)}{argmax}\, score(s,t)

t: candidate tree
\hat{t}: predicted tree
s: input sentence
T(s): set of candidate trees for s

Complexity

choice of T (upper bound: n^{n-1}, where n is the number of words in s)
scoring function (in the arc-factor model, the score of a tree is the sum of the score of each edge, scored individually by a NN. This results in O(n^3) complexity)

Transition-based approaches

trees are built through a sequence of steps, called transitions
training requires:
- a gold-standard treebank (as for graph-based approaches)
- an oracle i.e. an algorithm that converts each tree into a a gold-standard sequence of transitions
much more efficient: O(n)

Evaluation

2 main metrics:

UAS (Unlabelled Attachment Score): what's the fraction of nodes are attached to the correct dependency head?
LAS (Labelled Attachment Score): what's the fraction of nodes are attached to the correct dependency head with an arc labelled with the correct relation type¹?

Specifics of UD parsing

Not just parsing per se

UD "parsers" typically do a lot more than just dependency parsing:

lemmatization (LEMMA column)
POS tagging (UPOS + XPOS)
morphological tagging (FEATS)
...

Evaluation (UD-specific)

Some more specific metrics:

CLAS (Content-word LAS): LAS limited to content words
MLAS (Morphology-Aware LAS): CLAS that also uses the FEATS column
BLEX (Bi-Lexical dependency score): CLAS that also uses the LEMMA column

Evaluation script output

\small

Metric     | Precision |    Recall |  F1 Score | AligndAcc
-----------+-----------+-----------+-----------+-----------
Tokens     |    100.00 |    100.00 |    100.00 |
Sentences  |    100.00 |    100.00 |    100.00 |
Words      |    100.00 |    100.00 |    100.00 |
UPOS       |     98.36 |     98.36 |     98.36 |     98.36
XPOS       |    100.00 |    100.00 |    100.00 |    100.00
UFeats     |    100.00 |    100.00 |    100.00 |    100.00
AllTags    |     98.36 |     98.36 |     98.36 |     98.36
Lemmas     |    100.00 |    100.00 |    100.00 |    100.00
UAS        |     92.73 |     92.73 |     92.73 |     92.73
LAS        |     90.30 |     90.30 |     90.30 |     90.30
CLAS       |     88.50 |     88.34 |     88.42 |     88.34
MLAS       |     86.72 |     86.56 |     86.64 |     86.56
BLEX       |     88.50 |     88.34 |     88.42 |     88.34

Three generations of parsers

MaltParser (Nivre et al., 2006): "classic" transition-based parser, data-driven but not NN-based
UDPipe: neural transition-based parser; personal favorite
- version 1 (Straka et al. 2016): solid and fast software, available anywhere
- version 2 (Straka et al. 2018): much better performance, but slower and only available through an API
MaChAmp (van der Goot et al., 2021): transformer-based toolkit for multi-task learning, works on all CoNNL-like data, close to the SOTA, relatively easy to install and train

Your task (lab 3)

annotate a small treebank for your language of choice (started)
train a parser-tagger with MaChAmp on a reference UD treebank (tomorrow: installation)
evaluate it on your treebank

Sources/further reading

chapters 18-19 of the January 2024 draft of Speech and Language Processing (Jurafsky & Martin) (full text available here)
unit 3-2 of Johansson & Kuhlmann's course "Deep Learning for Natural Language Processing" (slides and videos available here)
section 10.9.2 on parser evaluation from Aarne's course notes (on Canvas or here)

Papers describing the parsers

MaltParser: A Data-Driven Parser-Generator for Dependency Parsing (Nivre et al. 2006) (PDF here)
UDPipe: Trainable Pipeline for Processing CoNLL-U Files Performing Tokenization, Morphological Analysis, POS Tagging and Parsing (Straka et al. 2016) (PDF here)
UDPipe 2.0 Prototype at CoNLL 2018 UD Shared Task (Straka et al. 2018) (PDF here)
Massive Choice, Ample Tasks (MACHAMP): A Toolkit for Multi-task Learning in NLP (van der Goot et al., 2021) (PDF here)

in UD: the DEPREL column ↩︎

6.6 KiB Raw Blame History

Today's topic

Parsing

A structured prediction task

Example (argmining)

Example (argmining)

Syntactic parsing

From sentence to tree

Example (GF)

Example (GF)

Example (GF)

Dependency parsing

Example (UD)

Two paradigms

Graph-based approaches

Complexity

Transition-based approaches

Evaluation

Specifics of UD parsing

Not just parsing per se

Evaluation (UD-specific)

Evaluation script output

Three generations of parsers

Your task (lab 3)

Sources/further reading

Papers describing the parsers

6.6 KiB

Raw Blame History