Improving Language Understanding by Generative Pre-Training (GPT): Difference between revisions

1,031 bytes added , 23 February 2023

Created page with "===Introduction=== In June 2018, OpenAI introduced GPT-1, a language model that combined unsupervised pre-training with the transformer architecture to achieve significant progress in natural language understanding. The team fine-tuned the model for specific tasks and found that pre-training helped it perform well on various NLP tasks with minimal fine-tuning. GPT-1 used the BooksCorpus dataset and self-attention in the transformer's decoder with 117 million parameters,..."

Daikon Radish

370

edits

Improving Language Understanding by Generative Pre-Training (GPT): Difference between revisions

Improving Language Understanding by Generative Pre-Training (GPT) (view source)

Revision as of 16:01, 23 February 2023