Jump to content

Page history

Improving Language Understanding by Generative Pre-Training (GPT)

2 March 2023

23 February 2023

Daikon Radish
no edit summary
16:03
+2,343
Daikon Radish
Created page with "===Introduction=== In June 2018, OpenAI introduced GPT-1, a language model that combined unsupervised pre-training with the transformer architecture to achieve significant progress in natural language understanding. The team fine-tuned the model for specific tasks and found that pre-training helped it perform well on various NLP tasks with minimal fine-tuning. GPT-1 used the BooksCorpus dataset and self-attention in the transformer's decoder with 117 million parameters,..."
16:01
+1,031

Retrieved from "http:///wiki/Special:History/Improving_Language_Understanding_by_Generative_Pre-Training_(GPT)"