-
Notifications
You must be signed in to change notification settings - Fork 103
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Attentive interface #802
base: autoregressive_refactor
Are you sure you want to change the base?
Attentive interface #802
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Jak se k tomu má encoders/attentive.py
? Jedno z toho bych přejmenoval (spíš ten enkodér) a taky bych tomu enkodéru oddědil věci, který jsou tady vyčleněný, pokud to dává smysl.
A rekuretní dekodér není attentive? |
Rekurentni je attentive, ale ten tedka predelavam v dalsim PR, tak aby pouzival stejnou implementaci enkoder attentionu jako transformer. |
Vubec nesouvisi. Ono je to tezky: Ondrovo Tezky je na tom to, ze bychom museli zmenit jeho terminy, aby to nevnaselo zmatek, ale pak by to tolik nesedelo na terminy v tom jeho clanku. Nechal bych to jako TODO pro refactor attentionu / attention objektu. |
IMHO stačí přejmenovat ten Ondrovo soubor nebo třídu. Něco jako self-attentive temporal encoder nebo tak něco. |
Jo, neco vymyslim. Edit: Prejmenovano na Structured |
806f6a2
to
d67d5c1
Compare
dee6855
to
60164f9
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Napřed prosim vyřešit vztah k #786 plus poznámky, který jsem dal k tomu, rebasnout a pak na to mrknu ještě jednou
Nejvdřív je potřeba pořešit #796, na kterým je tohleto naroubovaný. |
3495b1d
to
cd1e15a
Compare
3e5ab5a
to
89eb9c2
Compare
6ed29a5
to
c446c9c
Compare
Autoregressive decoders refactor
Unify RNN encoders
ping |
8c1c458
to
3fbbcf7
Compare
neuralmonkey/dataset.py
Outdated
@@ -95,6 +95,84 @@ def __init__(self, | |||
# pylint: enable=too-few-public-methods | |||
|
|||
|
|||
def _bucket_boundaries(max_length, min_length=8, length_bucket_step=1.1): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
typy
@@ -0,0 +1,167 @@ | |||
"""TODO.""" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
?
the attention. | ||
|
||
TODO: | ||
Generalize the attention. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
co je tohle todo?
queries, enc_states, enc_masks, self.n_heads_enc, | ||
self.n_heads_hier, attn_dropout_cbs, dropout_cb) | ||
|
||
# TODO: remove this - this is already checked in the constructor |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
odstranit? nebo smazat todo?
@@ -132,7 +132,7 @@ def __init__(self, | |||
postprocess: The postprocessor to apply to the output data. | |||
""" | |||
check_argument_types() | |||
BaseRunner[BeamSearchDecoder].__init__(self, output_series, decoder) | |||
super().__init__(output_series, decoder) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
tyhle změny už jsem dělal a jsou zamergovaný, proč to tu je? je to narebasovaný správně?
@@ -9,5 +9,5 @@ python_speech_features | |||
pygments | |||
rouge==0.2.1 | |||
typeguard | |||
sacrebleu | |||
sacrebleu==1.3.1 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
to tu nemá bejt. nová verze sacrebleu už je kompatibilní - byl to jejich bug
@@ -9,5 +9,5 @@ python_speech_features | |||
pygments | |||
rouge==0.2.1 | |||
typeguard | |||
sacrebleu | |||
sacrebleu==1.3.1 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ditto
@@ -1 +1 @@ | |||
mypy | |||
mypy==0.660 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
pryč
3fbbcf7
to
209b68b
Compare
Fixed Dataset.subset when buffer_size is None
…encoder attention
209b68b
to
38c0bf8
Compare
Rebase opraven. |
ten pull request je nastavenej na mergování do branche |
Moved attention-related attributes/methods to a separate class Attentive. Every decoder that requires computing attention against the encoders should inherit this class.