-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Analysis of the restricted corpus #57
Comments
only conjunction is: Paper and scissors both cut scissors has lemmatization problem, no concept found. paper is wrong concept, newspaper. cut is wrong concept, process only. want to distribute? enhanced CoreNLP does. |
13 copulas:
4 copulas are wrong as far as dependencies are concerned. |
19 negations marked:
all dependencies seem fine. However there are several semantic negations that are not marked.
and also:
|
Looking for the 20 nns is difficult, as the reps use NN for noun.
Wrong ones:
|
there are 4 dependencies "dep"
|
"acomp"=2 ===> falling asleep. wrong ones: a few kittens, few men, several children, |
particle verbs:
only one preposition: |
expletives
|
5 passive voices only, all correct.
|
advmodifiers: (34 in total)
Others:
|
actually in the whole corpus we have 112 adverbs ending in "ly" -- this is for future use in the nomlex for adjective/adverbs. they're attached. |
I do not understand why only the verb "sit" gives a different meaning to the auxiliary "to be". |
@vcvpaiva once more, so answer this puzzle we need an interactive implementation of UKB with a traces! We would need to see how the algorithm evolves during the computation to choose a sense A instead of a sense B for a given word W. |
1 occ of man mapping to Man, 111 occs mapping to Hominid? |
17 correct adjectives: empty, hungry, immobile some of the adjectives are past participles: scared, animated, bored which causes problems. |
only 50 nsubjs, kind of:
|
dobjs:
|
According to stats (https://github.com/own-pt/rte-sick/blob/master/expanded/conllu/len.3-5.conllu.stats.xml)
390 sentences (385 nsubj+5nsubjpass), 1 conjunction only, 4 dependencies 'dep', 13 copulas, 140 direct objects, 26 expletives, 19 negations and 20 noun-nouns.
1 prep and 9 particles.
The rest is:
"acomp"=2
"advmod"=34
"amod"= 20
"partmod"= 10
"pobj"= 1
also 30 ADJ, 45 ADV
"quantmod"= 3
The text was updated successfully, but these errors were encountered: