Few-shot Sequence Learning with Transformers