NMT and TGNN Project Documentation with Anas Mohammad #267

thangk · 2024-11-26T22:10:50Z

A placeholder issue page for tracking all future NMT and TGNN project milestones such as issues, solutions, guides, results, anything related to the two with Anas Mohammad.

I'll update this first post as we progress and needed.

Groovyfalafel · 2024-11-26T22:15:22Z

Excited to work under the supervision of Kap.

thangk · 2024-11-29T13:41:28Z

@Groovyfalafel

Today's agenda:

explain the OpeNTF library and our specific part in the project
run the dblp toy dataset
get instruction on tasks to work on next week (start convs2s ablation study/experiments)

Can you come on Teams at 2 pm to work on these?

Groovyfalafel · 2024-11-29T18:19:54Z

Sounds Great, I'll be available at 3pm, I'm busy till then.

thangk · 2024-12-02T04:43:27Z

@Groovyfalafel ,

Can you re-run the transformer model but before running do:

resync your forked repo as I've updated the _template.sh for using a toy dataset
re-duplicate a transformer model, change the train_steps to something like 20 and update the 3 lines below it to be 10% of this 20 value
setup a new bash script from the new _template_v1.4.1.sh I've just updated with
in the script, set dataset as toy_dblp
re-run it again

Groovyfalafel · 2024-12-02T05:08:20Z

I will be doing so soon, so far i have

Tried to download the image from docker hub but ran into some few errors, I had to login into docker from the terminal to fix it.

2.Tried running a bash script to run a gith model but it would display an error message indicating there is no such file. Found a solution on stack overflow telling me to download dos2unix. It worked and i can now run bash scripts.

3.Tried running a model but it freezes after a couple of seconds, kept my pc open but still wouldn't finish training the model, the process ran for hours but no result. hopefully after these fixes it should work.

thangk · 2024-12-03T20:17:13Z

@Groovyfalafel

Were you able to run the toy dblp until the end?

Groovyfalafel · 2024-12-04T01:12:07Z

I have tried running it and implemented what you suggested setting the bucket size to 128 yet I am still running into issues.

thangk · 2024-12-04T03:44:09Z

I have tried running it and implemented what you suggested setting the bucket size to 128 yet I am still running into issues.

Anything in the non error log?

Groovyfalafel · 2024-12-04T04:55:17Z

Nothing unusual no

thangk · 2024-12-04T12:08:50Z

What errors you get? Post what's in the non error log and also like last 20 lines of error log using code blocks here.

Groovyfalafel · 2024-12-05T04:04:41Z

Loading indexes pickle from ./../data/preprocessed/dblp/toy.dblp.v12.json/indexes.pkl ...
It took 0.016460180282592773 seconds to load from the pickles.
It took 0.03437948226928711 seconds to load the sparse matrices.

Only one GPU detected. Using it (if CUDA is available).

Using device: cuda
Running for (dataset, model): (dblp, nmt_toy_dblp_second_test) ...

			* corpus_1: 6381
[2024-12-03 21:51:45,573 INFO] Weighted corpora loaded so far:
			* corpus_1: 6382
[2024-12-03 21:51:45,577 INFO] Weighted corpora loaded so far:
			* corpus_1: 6383
[2024-12-03 21:51:45,580 INFO] Weighted corpora loaded so far:
			* corpus_1: 6384
[2024-12-03 21:51:45,584 INFO] Weighted corpora loaded so far:
			* corpus_1: 6385
[2024-12-03 21:51:45,587 INFO] Weighted corpora loaded so far:
			* corpus_1: 6386
[2024-12-03 21:51:45,591 INFO] Weighted corpora loaded so far:
			* corpus_1: 6387
[2024-12-03 21:51:45,594 INFO] Weighted corpora loaded so far:
			* corpus_1: 6388
[2024-12-03 21:51:45,597 INFO] Weighted corpora loaded so far:
			* corpus_1: 6389
[2024-12-03 21:51:45,600 INFO] Weighted corpora loaded so far:
			* corpus_1: 6390
[2024-12-03 21:51:45,603 INFO] Weighted corpora loaded so far:
			* corpus_1: 6391
[2024-12-03 21:51:45,606 INFO] Weighted corpora loaded so far:
			* corpus_1: 6392
[2024-12-03 21:51:45,609 INFO] Weighted corpora loaded so far:
			* corpus_1: 6393
[2024-12-03 21:51:45,612 INFO] Weighted corpora loaded so far:
			* corpus_1: 6394
[2024-12-03 21:51:45,615 INFO] Weighted corpora loaded so far:
			* corpus_1: 6395
[2024-12-03 21:51:45,619 INFO] Weighted corpora loaded so far:
			* corpus_1: 6396
[2024-12-03 21:51:45,621 INFO] Weighted corpora loaded so far:
			* corpus_1: 6397
[2024-12-03 21:51:45,625 INFO] Weighted corpora loaded so far:
			* corpus_1: 6398
[2024-12-03 21:51:45,627 INFO] Weighted corpora loaded so far:
			* corpus_1: 6399
[2024-12-03 21:51:45,631 INFO] Weighted corpora loaded so far:
			* corpus_1: 6400 ```

thangk · 2024-12-06T14:49:23Z

@Groovyfalafel

Can you lower the bucket size even more to like 16? and then retry.

Groovyfalafel · 2024-12-07T21:04:33Z

Froze again this time ending at line 3179.

thangk · 2024-12-09T13:06:58Z

Froze again this time ending at line 3179.

And nothing in non error log?

Groovyfalafel · 2024-12-10T23:48:44Z

No nothing new.

thangk · 2024-12-13T18:05:45Z

@Groovyfalafel

Okay, we'll try something else. Can you try and use another model, either RNN or CNN (convs2s). Duplicate the model file like the Transformer yml and the rest are the same (ie. making the script for it). Set bucket size to a low number again like 8 or 16).

thangk self-assigned this Nov 26, 2024

thangk added the documentation Improvements or additions to documentation label Nov 26, 2024

thangk assigned Groovyfalafel Nov 26, 2024

thangk changed the title ~~NMT and TGNN Project Milestones with Anas Mohammad~~ NMT and TGNN Project Documentation with Anas Mohammad Nov 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NMT and TGNN Project Documentation with Anas Mohammad #267

NMT and TGNN Project Documentation with Anas Mohammad #267

thangk commented Nov 26, 2024

Groovyfalafel commented Nov 26, 2024

thangk commented Nov 29, 2024

Groovyfalafel commented Nov 29, 2024

thangk commented Dec 2, 2024 •

edited

Loading

Groovyfalafel commented Dec 2, 2024

thangk commented Dec 3, 2024 •

edited

Loading

Groovyfalafel commented Dec 4, 2024

thangk commented Dec 4, 2024

Groovyfalafel commented Dec 4, 2024

thangk commented Dec 4, 2024

Groovyfalafel commented Dec 5, 2024 •

edited

Loading

thangk commented Dec 6, 2024 •

edited

Loading

Groovyfalafel commented Dec 7, 2024

thangk commented Dec 9, 2024

Groovyfalafel commented Dec 10, 2024

thangk commented Dec 13, 2024

NMT and TGNN Project Documentation with Anas Mohammad #267

NMT and TGNN Project Documentation with Anas Mohammad #267

Comments

thangk commented Nov 26, 2024

Groovyfalafel commented Nov 26, 2024

thangk commented Nov 29, 2024

Groovyfalafel commented Nov 29, 2024

thangk commented Dec 2, 2024 • edited Loading

Groovyfalafel commented Dec 2, 2024

thangk commented Dec 3, 2024 • edited Loading

Groovyfalafel commented Dec 4, 2024

thangk commented Dec 4, 2024

Groovyfalafel commented Dec 4, 2024

thangk commented Dec 4, 2024

Groovyfalafel commented Dec 5, 2024 • edited Loading

thangk commented Dec 6, 2024 • edited Loading

Groovyfalafel commented Dec 7, 2024

thangk commented Dec 9, 2024

Groovyfalafel commented Dec 10, 2024

thangk commented Dec 13, 2024

thangk commented Dec 2, 2024 •

edited

Loading

thangk commented Dec 3, 2024 •

edited

Loading

Groovyfalafel commented Dec 5, 2024 •

edited

Loading

thangk commented Dec 6, 2024 •

edited

Loading