Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

train jiweil and AdaGram on umbc #16

Open
makrai opened this issue Jun 13, 2016 · 13 comments
Open

train jiweil and AdaGram on umbc #16

makrai opened this issue Jun 13, 2016 · 13 comments
Assignees

Comments

@makrai
Copy link
Member

makrai commented Jun 13, 2016

No description provided.

@makrai makrai self-assigned this Jun 13, 2016
@makrai makrai changed the title train jiweil umbc train jiweil and AdaGram on umbc Jun 13, 2016
@makrai
Copy link
Member Author

makrai commented Jun 15, 2016

AdaGram finishes on Thursday

@makrai
Copy link
Member Author

makrai commented Jun 17, 2016

lefutott a jiweil az umbc-n, pontosabban OutOfMemoryError-t dobott, úgyhogy fenntartásokkal kell kezelni az outputot. Ha nagyon más jön ki, mint a többi nyevre, akkor meg kéne csinálni kisebb korpuszon

@gaebor
Copy link
Member

gaebor commented Jun 17, 2016

nessie-n próbáltad? ott van a legtöbb memória.

@makrai
Copy link
Member Author

makrai commented Jun 17, 2016

I did some mistake in moving the jiweil embedding to store, I will tell you when I finish

@makrai
Copy link
Member Author

makrai commented Jun 17, 2016

AdaGram has finished
real 3996m50.961s
not I do the postprocessing

@makrai
Copy link
Member Author

makrai commented Jun 17, 2016

a jiweil-t ügyesen kitöröltem, úgyhogy csinlom újra. Az AdaGram mindjárt a helyén lesz
/mnt/store/hlt/Language/English/Embed/multiprot/adagram/umbc-600-1epoch.mse

@DavidNemeskey
Copy link
Collaborator

DavidNemeskey commented Jun 17, 2016

Adagramra lefuttattam. Jó korrelációk több mindennel is (neela, huang). És @gaebor -nak igaza van, a logolt frekvenciákkal mind Spearmannel, mind Pearsonnal jó egyezést mutat.

@makrai
Copy link
Member Author

makrai commented Jun 20, 2016

a jiweil lehet, hogy nem lesz meg emberi időn belül, úgyhogy most átnézem a németet és a magyart, utána pedig lehet, hogy lefuttatok mindent egy angol webkorpuszon (ukWaC?)

@gaebor
Copy link
Member

gaebor commented Jun 20, 2016

@makrai miért akarnál ukWaC-on tanítani, az UMBC nem jó?

@makrai
Copy link
Member Author

makrai commented Jun 20, 2016

attól tartok, az UMBC-n nem készül el időben

@gaebor
Copy link
Member

gaebor commented Jun 20, 2016

az ukWaC miért készülne el gyorsabban?

@makrai
Copy link
Member Author

makrai commented Jun 20, 2016

mert hülye vagyok

store/hlt/Language/English/Crawl$ wc UMBC_Webbase/corpus.cleaned.without.pos.txt ukwac/UKWAC.spl 
   40586438  3338409198 18886715300 UMBC_Webbase/corpus.cleaned.without.pos.txt
   88214600  2247153469 12297438448 ukwac/UKWAC.spl

@gaebor
Copy link
Member

gaebor commented Jun 20, 2016

UMBC-ből van felezett is /mnt/store/hlt/Language/English/Crawl/UMBC_Webbase/umbc.even.paragraphs.txt vagy umbc.odd.paragraphs.txt

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants