Improving task-agnostic BERT distillation with layer mapping search

Jiao, X.; Chang, H.; Yin, Y.; Shang, L.; Jiang, X.; Chen, X.; Li, L.; Wang, F.; Liu, Q.

Neurocomputing 461: 194-203

2021


ISSN/ISBN: 0925-2312
DOI: 10.1016/j.neucom.2021.07.050
Accession: 084710732

Full-Text Article emailed within 0-6 h
Payments are secure & encrypted
Powered by Stripe
Powered by PayPal