A Roadmap for Big Model

With the rapid development of deep learning, training Big Models (BMs) for multiple downstream tasks becomes a popular paradigm. Researchers have achieved various outcomes in the construction of BMs and the BM application in many fields. At present, there is a lack of research work that sorts out th...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2022-04
Hauptverfasser: Sha Yuan, Zhao, Hanyu, Zhao, Shuai, Leng, Jiahong, Liang, Yangxiao, Wang, Xiaozhi, Yu, Jifan, Lv, Xin, Zhou, Shao, He, Jiaao, Lin, Yankai, Xu, Han, Liu, Zhenghao, Ding, Ning, Rao, Yongming, Gao, Yizhao, Zhang, Liang, Ding, Ming, Fang, Cong, Wang, Yisen, Long, Mingsheng, Zhang, Jing, Dong, Yinpeng, Pang, Tianyu, Cui, Peng, Huang, Lingxiao, Zheng, Liang, Shen, Huawei, Zhang, Hui, Zhang, Quanshi, Dong, Qingxiu, Tan, Zhixing, Wang, Mingxuan, Wang, Shuo, Long, Zhou, Li, Haoran, Bao, Junwei, Pan, Yingwei, Zhang, Weinan, Zhou, Yu, Yan, Rui, Shi, Chence, Xu, Minghao, Zhang, Zuobai, Wang, Guoqiang, Pan, Xiang, Li, Mengjie, Chu, Xiaoyu, Yao, Zijun, Zhu, Fangwei, Cao, Shulin, Xue, Weicheng, Ma, Zixuan, Zhang, Zhengyan, Hu, Shengding, Qin, Yujia, Xiao, Chaojun, Zeng, Zheni, Cui, Ganqu, Chen, Weize, Zhao, Weilin, Yao, Yuan, Li, Peng, Zheng, Wenzhao, Zhao, Wenliang, Wang, Ziyi, Zhang, Borui, Nanyi Fei, Hu, Anwen, Ling, Zenan, Li, Haoyang, Cao, Boxi, Han, Xianpei, Zhan, Weidong, Chang, Baobao, Sun, Hao, Deng, Jiawen, Zheng, Chujie, Li, Juanzi, Hou, Lei, Cao, Xigang, Zhai, Jidong, Liu, Zhiyuan, Sun, Maosong, Lu, Jiwen, Lu, Zhiwu, Qin, Jin, Song, Ruihua, Ji-Rong, Wen, Lin, Zhouchen, Wang, Liwei, Su, Hang, Zhu, Jun, Sui, Zhifang, Zhang, Jiajun, Liu, Yang, He, Xiaodong, Huang, Minlie, Tang, Jian, Tang, Jie
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:With the rapid development of deep learning, training Big Models (BMs) for multiple downstream tasks becomes a popular paradigm. Researchers have achieved various outcomes in the construction of BMs and the BM application in many fields. At present, there is a lack of research work that sorts out the overall progress of BMs and guides the follow-up research. In this paper, we cover not only the BM technologies themselves but also the prerequisites for BM training and applications with BMs, dividing the BM review into four parts: Resource, Models, Key Technologies and Application. We introduce 16 specific BM-related topics in those four parts, they are Data, Knowledge, Computing System, Parallel Training System, Language Model, Vision Model, Multi-modal Model, Theory&Interpretability, Commonsense Reasoning, Reliability&Security, Governance, Evaluation, Machine Translation, Text Generation, Dialogue and Protein Research. In each topic, we summarize clearly the current studies and propose some future research directions. At the end of this paper, we conclude the further development of BMs in a more general view.
ISSN:2331-8422