Is there anything that makes training a translation task easy?

hok@lemmy.dbzer0.com · 1 year ago

Thanks for the tips. After doing a bunch of searching, I found that what I needed was BPE, or byte-pair encoding. This allows the token set to contain sub-word sequences, which lets the tokenizer represent a unique constant like 0x0373 as ['__sow', '0x', '03', '73', '__eow'].

hok@lemmy.dbzer0.com · 1 year ago

Thanks, the quickstart guide was straightforward to follow. Do you have any suggestions on how to do word splitting with code, if any? For example, on a test run, I found that the model was not able to synthesize unique constants correctly even though this test run consisted only of obvious “a to b” relationships.

hok@lemmy.dbzer0.com · 1 year ago

Is there anything that makes training a translation task easy?