Training Language Models with Memory Augmentation

Type: Preprint

Publication Date: 2022-01-01

Citations: 3

DOI: https://doi.org/10.48550/arxiv.2205.12674

Locations

  • arXiv (Cornell University) - View
  • DataCite API - View

Similar Works

Action Title Year Authors
+ PDF Chat Training Language Models with Memory Augmentation 2022 Zexuan Zhong
Tao Lei
Danqi Chen
+ PEMA: Plug-in External Memory Adaptation for Language Models 2023 Hyunjin Kim
Young Jin Kim
JinYeong Bak
+ Pluggable Neural Machine Translation Models via Memory-augmented Adapters 2023 Yuzhuang Xu
Shuo Wang
Peng Li
Xuebo Liu
Xiaolong Wang
Weidong Liu
Yang Liu
+ LMTuner: An user-friendly and highly-integrable Training Framework for fine-tuning Large Language Models 2023 Yixuan Weng
Zhiqi Wang
Huanxuan Liao
Shizhu He
Shengping Liu
Kang Liu
Jun Zhao
+ PDF Chat Continual Learning for Large Language Models: A Survey 2024 Tongtong Wu
Linhao Luo
Yuan-Fang Li
Shirui Pan
Thuy-Trang Vu
Gholamreza Haffari
+ Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch 2023 Le Yu
Bowen Yu
Haiyang Yu
Fei Huang
Yongbin Li
+ PDF Chat Pre-training Small Base LMs with Fewer Tokens 2024 Sunny Sanyal
Sujay Sanghavi
Alexandros G. Dimakis
+ SAS: Self-Augmented Strategy for Language Model Pre-training 2021 Yifei Xu
Jingqiao Zhang
Ru He
Liangzhu Ge
Chao Yang
Cheng Yang
Ying Wu
+ PDF Chat On the Effectiveness of Incremental Training of Large Language Models 2024 Miles Q. Li
Benjamin C. M. Fung
Shih‐Chia Huang
+ PDF Chat MoE-CT: A Novel Approach For Large Language Models Training With Resistance To Catastrophic Forgetting 2024 Tianhao Li
Shangjie Li
Binbin Xie
Deyi Xiong
Baosong Yang
+ PDF Chat When Life gives you LLMs, make LLM-ADE: Large Language Models with Adaptive Data Engineering 2024 Stephen J. Choi
William Gazeley
+ Achieving Peak Performance for Large Language Models: A Systematic Review 2024 Zhyar Rzgar K. Rostam
Såndor Szénåsi
Gåbor Kertész
+ PDF Chat Large Language Models: A Survey 2024 Shervin Minaee
Tomas Mikolov
Narjes Nikzad-Khasmakhi
Meysam Chenaghlu
Richard Socher
Xavier Amatriain
Jianfeng Gao
+ PDF Chat CAMELoT: Towards Large Language Models with Training-Free Consolidated Associative Memory 2024 Zexue He
Leonid Karlinsky
Donghyun Kim
Julian McAuley
Dmitry Krotov
Rogério Feris
+ Prompting Neural Machine Translation with Translation Memories 2023 Abudurexiti Reheman
Tao Zhou
Yingfeng Luo
Di Yang
Tong Xiao
Jingbo Zhu
+ PDF Chat Prompting Neural Machine Translation with Translation Memories 2023 Abudurexiti Reheman
Tao Zhou
Yingfeng Luo
Di Yang
Tong Xiao
Jingbo Zhu
+ MaLA-500: Massive Language Adaptation of Large Language Models 2024 Peiqin Lin
Shaoxiong Ji
Jörg Tiedemann
André F. T. Martins
Hinrich SchĂŒtze
+ PDF Chat Sparsity-Accelerated Training for Large Language Models 2024 Da Ma
Lu Chen
Pengyu Wang
Hongshen Xu
Hanqi Li
Liangtai Sun
Su Zhu
Shuai Fan
Kai Yu
+ PDF Chat Memory-augmented Neural Machine Translation 2017 Yang Feng
Shiyue Zhang
Andi Zhang
Dong Wang
Andrew Abel
+ PDF Chat CombLM: Adapting Black-Box Language Models through Small Fine-Tuned Models 2023 Aitor Ormazabal
Mikel Artetxe
Eneko Agirre

Works Cited by This (0)

Action Title Year Authors