TinyTL: Reduce Memory, Not Parameters for Efficient On-Device LearningHan CaiChuang Ganet al.2020NeurIPS 2020
HAT: Hardware-Aware Transformers for Efficient Neural Machine TranslationHanrui WangZhanghao Wuet al.2020ACL 2020