Large Language Model(LLM)

由 datahunter 在二, 25/04/2023 - 11:28 發表

最後更新: 2023-04-25

LLaMA

由 Meta 開發

License: noncommercial license

Parameters

65B, 1.4T tokens
33B, 1.4T tokens
13B, 1T tokens
7B, 1T tokens

Training dataset(English)

CCNet [67%],
C4 [15%],                # 783 GB
GitHub [4.5%],         # 328 GB
Wikipedia [4.5%],     # 83 GB
Books [4.5%],          # 85 GB
ArXiv [2.5%],
Stack Exchange[2%] # 78 GB

text-davinci-003

OpenAI, GPT-3

This model builds on top of InstructGPT
trained with humans in the loop
(reinforcement learning from human feedback (RLHF))

Alpaca

instruction-following language model

fine-tuned from the LLaMA 7B model on 52K instruction-following demonstrations

瀏覽次數： 138

夢想家

Large Language Model(LLM)

目錄

LLaMA

text-davinci-003

Alpaca