BestRQ implementation #63

Judyxujj · 2024-11-18T15:23:26Z

add the two components, i.e. mask and quantiser, for BestRQ.

michelwi

Not sure if there are problems in the code or if I misunderstood what it is doing 🤷

i6_models/parts/best_rq/mask.py

i6_models/parts/best_rq/quantizer.py

albertz · 2024-11-25T17:02:51Z

Does it make sense to include this in i6_models? What exactly is our policy w.r.t. what we want to have here? I thought i6_models is intended for models (and maybe other functions) which are supposed to be used by a larger amount of people among us. But it is not intended for research code, where you want to try something out. Here, as I understand, this is mostly adapted from Fairseq, so maybe it's already well tested, but I don't know. Did you test this?

Co-authored-by: michelwi <[email protected]>

Judyxujj · 2024-12-04T15:01:28Z

Does it make sense to include this in i6_models? What exactly is our policy w.r.t. what we want to have here? I thought i6_models is intended for models (and maybe other functions) which are supposed to be used by a larger amount of people among us. But it is not intended for research code, where you want to try something out. Here, as I understand, this is mostly adapted from Fairseq, so maybe it's already well tested, but I don't know. Did you test this?

I think BestRQ is a very general pre-training architecture that we might use later also in the group, so I add it here. I ran them with a small dataset and seems to work.

i6_models/parts/best_rq/mask.py

curufinwe · 2024-12-20T09:08:08Z

Does it make sense to include this in i6_models? What exactly is our policy w.r.t. what we want to have here? I thought i6_models is intended for models (and maybe other functions) which are supposed to be used by a larger amount of people among us. But it is not intended for research code, where you want to try something out. Here, as I understand, this is mostly adapted from Fairseq, so maybe it's already well tested, but I don't know. Did you test this?

I have no problem adding also some code that is a bit more "research-y" to the repo. But of course once it's in we would require that new variants get a separate class name to not break the old behavior (bugs are the exception of course). If we see that some model / part of a model is changing rapidly we might want to delay things until it settles down a bit, but in this instance I would expect the code from fairseq to broadly work.

Judyxujj and others added 2 commits November 11, 2024 17:16

add best rq part

1972d99

update

8b3ed95

Judyxujj requested review from albertz, curufinwe, JackTemaki, mmz33, michelwi, vieting and Atticus1806 November 18, 2024 15:23

Judyxujj self-assigned this Nov 18, 2024

vieting changed the title ~~BestBQ implementation~~ BestRQ implementation Nov 19, 2024

michelwi requested changes Nov 25, 2024

View reviewed changes

Jingjing Xu and others added 2 commits December 4, 2024 14:35

update

b34b1dc

Update i6_models/parts/best_rq/mask.py

b50b66e

Co-authored-by: michelwi <[email protected]>

black

40c369b

michelwi approved these changes Dec 9, 2024

View reviewed changes

i6_models/parts/best_rq/mask.py Show resolved Hide resolved

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BestRQ implementation #63

BestRQ implementation #63

Judyxujj commented Nov 18, 2024

michelwi left a comment

albertz commented Nov 25, 2024

Judyxujj commented Dec 4, 2024

curufinwe commented Dec 20, 2024

BestRQ implementation #63

Are you sure you want to change the base?

BestRQ implementation #63

Conversation

Judyxujj commented Nov 18, 2024

michelwi left a comment

Choose a reason for hiding this comment

albertz commented Nov 25, 2024

Judyxujj commented Dec 4, 2024

curufinwe commented Dec 20, 2024