ChatGPT: Difference between revisions

m
Text replacement - "|List of ChatGPT Plugins" to "|Custom GPTs"
m (Text replacement - "|List of ChatGPT Plugins" to "|Custom GPTs")
 
(23 intermediate revisions by 4 users not shown)
Line 1: Line 1:
{{see also|ChatGPT Guides}}
{{see also|Custom GPTs|ChatGPT Guides}}
==Introduction==
==Introduction==
[[File:Gpt4-122.jpg|thumb|Figure 1: ChatGPT user interface. Source: OpenAI.]]
[[File:Gpt4-122.jpg|thumb|Figure 1: ChatGPT user interface. Source: OpenAI.]]
Developed by the [[AI]] research lab [[OpenAI]], ChatGPT is a [[large language model]] ([[LLM]]) that generates text based on an [[input]] ([[prompt]]) from a user (figure 1). It interacts in a conversational way, answering follow-up questions, admitting to its mistakes, refusing inappropriate requests, and challenging incorrect premises. <ref name="”1”">OpenAI (2022). ChatGPT: Optimizing language models for dialogue. OpenAI. https://openai.com/blog/chatgpt/</ref> <ref name="”2”">Kelly, SM (2022). This AI chatbot is dominating social media with its frighteningly good essays. CNN International. https://edition.cnn.com/2022/12/05/tech/chatgpt-trnd/index.html</ref> <ref name="”3”">Edwards, B (2022). No Linux? No problem. Just get AI to hallucinate it for you. Ars Technica. https://arstechnica.com/information-technology/2022/12/openais-new-chatbot-can-hallucinate-a-linux-shell-or-calling-a-bbs/</ref> The LLM was trained on a massive amount of information <ref name="”2”" /> and it's a derivative of [[InstructGPT]], a program trained to follow natural language instructions in a prompt and give an elaborate response. <ref name="”1”" />
Developed by the [[AI]] research lab [[OpenAI]], [[ChatGPT]] is a [[large language model]] ([[LLM]]) in the [[GPT|GPT series]] that generates text based on an [[input]] ([[prompt]]) from a user (figure 1). It interacts in a conversational way, answering follow-up questions, admitting to its mistakes, refusing inappropriate requests, and challenging incorrect premises. <ref name="”1”">OpenAI (2022). ChatGPT: Optimizing language models for dialogue. OpenAI. https://openai.com/blog/chatgpt/</ref> <ref name="”2”">Kelly, SM (2022). This AI chatbot is dominating social media with its frighteningly good essays. CNN International. https://edition.cnn.com/2022/12/05/tech/chatgpt-trnd/index.html</ref> <ref name="”3”">Edwards, B (2022). No Linux? No problem. Just get AI to hallucinate it for you. Ars Technica. https://arstechnica.com/information-technology/2022/12/openais-new-chatbot-can-hallucinate-a-linux-shell-or-calling-a-bbs/</ref> The LLM was trained on a massive amount of information <ref name="”2”" /> and it's a derivative of [[InstructGPT]], a program trained to follow natural language instructions in a prompt and give an elaborate response. <ref name="”1”" />


OpenAI's ChatGPT was introduced at the end of November 2022 during a public demo. <ref name="”1”" /> <ref name="”6”">Wiggers, K (2022). While anticipation builds for GPT-4, OpenAI quietly releases GPT-3.5. TechCrunch. https://techcrunch.com/2022/12/01/while-anticipation-builds-for-gpt-4-openai-quietly-releases-gpt-3-5/</ref> In the following days, it was already considered the best [[artificial intelligence]] ([[AI]]) chatbot ever released to the general public. <ref name="”4”">Koose, K (2022). The Brilliance and Weirdness of ChatGPT. The New York Times. https://www.nytimes.com/2022/12/05/technology/chatgpt-ai-twitter.html</ref> Its initial introduction was, according to the AI research lab, "to get users’ feedback and learn about its strengths and weaknesses." They are encouraged to report on problematic model outputs through the UI and false positives and negatives. During this phase, ChatGPT could be used for free at chat.openai.com. <ref name="”1”" />
OpenAI's ChatGPT was introduced at the end of November 2022 during a public demo. <ref name="”1”" /> <ref name="”6”">Wiggers, K (2022). While anticipation builds for GPT-4, OpenAI quietly releases GPT-3.5. TechCrunch. https://techcrunch.com/2022/12/01/while-anticipation-builds-for-gpt-4-openai-quietly-releases-gpt-3-5/</ref> In the following days, it was already considered the best [[artificial intelligence]] ([[AI]]) chatbot ever released to the general public. <ref name="”4”">Koose, K (2022). The Brilliance and Weirdness of ChatGPT. The New York Times. https://www.nytimes.com/2022/12/05/technology/chatgpt-ai-twitter.html</ref> Its initial introduction was, according to the AI research lab, "to get users’ feedback and learn about its strengths and weaknesses." They are encouraged to report on problematic model outputs through the UI and false positives and negatives. During this phase, ChatGPT could be used for free at chat.openai.com. <ref name="”1”" />
Line 20: Line 20:
[[File:GPT Training process.png|thumb|Figure 2: General overview of the training process using reinforcement learning from human feedback. Source: OpenAI.]]
[[File:GPT Training process.png|thumb|Figure 2: General overview of the training process using reinforcement learning from human feedback. Source: OpenAI.]]


The ChatGPT model was trained using [[Reinforcement Learning]] from Human Feedback (RLHF) following the same methods as InstructGPT (figure 2); only with small differences in the data collection setup. Training for an initial model was made using supervised fine-tuning where human AI trainers had "conversations in which they played both sides—the user and an AI assistant." <ref name="”1”" />
The ChatGPT model was trained using [[Reinforcement Learning from Human Feedback]] ([[RLHF]]) following the same methods as InstructGPT (figure 2); only with small differences in the data collection setup. Training for an initial model was made using supervised fine-tuning where human AI trainers had "conversations in which they played both sides—the user and an AI assistant." <ref name="”1”" />


Reinforcement learning uses a reward model for AI training. This was done by collecting comparison data consisting of two or more model responses ranked by quality. According to the official blog of OpenAI, "to collect this data, we took conversations that AI trainers had with the chatbot. We randomly selected a model-written message, sampled several alternative completions, and had AI trainers rank them. Using these reward models, we can fine-tune the model using Proximal Policy Optimization. We performed several iterations of this process." <ref name="”1”" />
Reinforcement learning uses a reward model for [[training|AI training]]. This was done by collecting comparison data consisting of two or more model responses ranked by quality. According to the official blog of OpenAI, "to collect this data, we took conversations that AI trainers had with the chatbot. We randomly selected a model-written message, sampled several alternative completions, and had AI trainers rank them. Using these reward models, we can fine-tune the model using Proximal Policy Optimization. We performed several iterations of this process." <ref name="”1”" />


GPT-3.5 was trained on a mix of text and code published before Q4, 2021. <ref name="”6”" /> Both ChatGPT (a fine-tuned version of a model in the GPT-3.5 series) and GPT-3.5 were trained on an Azure AI supercomputing infrastructure. <ref name="”1”" /> The model on which the chatbot is based, text-davinci-003, can handle more complex instructions with increased output quality and overall better in long-form writing (around 65% longer outputs than text-davinci-002). It also has fewer limitations (e.g. a reduction in "hallucinations") than previous versions and scores higher on human preference rating. <ref name="”6”" />
GPT-3.5 was trained on a mix of text and code published before Q4, 2021. <ref name="”6”" /> Both ChatGPT (a fine-tuned version of a model in the GPT-3.5 series) and GPT-3.5 were trained on an Azure AI supercomputing infrastructure. <ref name="”1”" /> The model on which the chatbot is based, text-davinci-003, can handle more complex instructions with increased output quality and overall better in long-form writing (around 65% longer outputs than text-davinci-002). It also has fewer limitations (e.g. a reduction in "hallucinations") than previous versions and scores higher on human preference rating. <ref name="”6”" />
Line 36: Line 36:
==Applications==
==Applications==


OpenAI's ChatGPT has myriad applications, from generating computer code, to writing jokes, explaining scientific concepts in different levels of complexity, to writing college-level essays. Besides its regular text composition skills, it can also provide assistance, by helping programmers spot and fix errors in code or answer questions made to it, analogous to a Google search. <ref name="”4”" /> Indeed, it was denominated as a possible "Google Killer". <ref name="”9”">Ansari, T (2022). These 8 potential use cases of ChatGPT will blow your mind!​​​​​​​. Analytics India Magazine. https://analyticsindiamag.com/these-8-potential-use-cases-of-chatgpt-will-blow-your-mind/</ref> Due to ChatGPT remembering what a user has written before, some have suggested that it could be possible to create personalized therapy bots. <ref name="”4”" />
OpenAI's ChatGPT has myriad applications, from generating computer code, to writing jokes, explaining scientific concepts in different levels of complexity, to writing college-level essays. Besides its regular text composition skills, it can also provide assistance, by helping programmers spot and fix errors in code or answer questions made to it, analogous to a [[Google]] search. <ref name="”4”" /> Indeed, it was denominated as a possible "Google Killer". <ref name="”9”">Ansari, T (2022). These 8 potential use cases of ChatGPT will blow your mind!​​​​​​​. Analytics India Magazine. https://analyticsindiamag.com/these-8-potential-use-cases-of-chatgpt-will-blow-your-mind/</ref> Due to ChatGPT remembering what a user has written before, some have suggested that it could be possible to create personalized therapy bots. <ref name="”4”" />


Overall, the potential of this application is enormous, possibly having an impact on different business sectors.
Overall, the potential of this application is enormous, possibly having an impact on different business sectors.
*[[AI prompt]] generation, for art creation in models like [[DALL-E]] 2 or [[Midjourney]].
*[[AI prompt]] generation, for art creation in models like [[DALL-E]] 2 or [[Midjourney]].
*Act like a virtual cloud, stringing together cloud services to achieve complex tasks.
*Act like a virtual cloud, stringing together cloud services to achieve complex tasks.
*Content generation, changing how content is created for thousands of companies.
*[[Content generation]], changing how content is created for thousands of companies.
*Possible impact on education tech by answering to doubts.
*Possible impact on education tech by answering to doubts.
*Sort, manage, and organize unstructured data.
*Sort, manage, and organize unstructured data.
Line 74: Line 74:
*Faster response times
*Faster response times
*Access to new features and improvements earlier
*Access to new features and improvements earlier
==ChatGPT API==
ChaptGPT API was released on March 1st, 2023 alongside the release of [[Whisper]] API.<ref name="”11”">Introducing ChatGPT and Whisper APIs https://openai.com/blog/introducing-chatgpt-and-whisper-apis</ref>
==ChatGPT Plugins==
{{see also|Custom GPTs}}
[[ChatGPT Plugins]] can extend its capabilities by providing access to recent or specific information not from [[training data]] and allowing it to access external [[APIs]]. There are some plugins created by [[OpenAI]] like [[browsing]], [[code interpretation]] and [[retrieval]]. Developers can also create their own plugins for ChatGPT, and efforts are underway to develop open standards for AI-facing interfaces.
Browsing capabilities, code interpreters, retrieval plugins, and third-party plugins are being explored to expand the range of applications and provide more relevant and up-to-date information for ChatGPT.
==Guides==
{{see also|ChatGPT Guides}}
{{:ChatGPT Guides}}


==Uses==
==Uses==
{{see also|ChatGPT Uses}}
{{see also|ChatGPT Uses}}
{{:ChatGPT Uses}}
{{:ChatGPT Uses}}
==Updates==
===August 14, 2023===
[[Azure ChatGPT]]
===January 30, 2023===
====Official release note====
We’ve upgraded the ChatGPT model with improved factuality and mathematical capabilities.
====Comments and discoveries====
While ChatGPT became more factual and better at math, before each user entered [[prompts]], [[OpenAI]] asked ChatGPT to be as concise as possible. Proof before the ability to display the previous prompt was disabled: [https://preview.redd.it/0h0kop4ja4ga1.jpeg?width=1080&format=pjpg&auto=webp&v=enabled&s=4b458b2fe34d78043ba12a7ef7cf78f2fc589d5e proof 1], [https://preview.redd.it/uam77takn4ga1.png?width=556&format=png&auto=webp&v=enabled&s=0594837709cd8fd8f8a2b10ec84af974a47f6c0b proof 2].
===January 9, 2023===
====Official release note====
We're excited to announce several updates to ChatGPT! Here's what's new:
#We made more improvements to the ChatGPT model! It should be generally better across a wide range of topics and has improved factuality.
#Stop generating: Based on your feedback, we've added the ability to stop generating ChatGPT's response
====Comments and discoveries====
===December 15, 2023===
====Official release note====
We're excited to announce several updates to ChatGPT! Here's what's new:
#General performance: Among other improvements, users will notice that ChatGPT is now less likely to refuse to answer questions.
#Conversation history: You’ll soon be able to view past conversations with ChatGPT, rename your saved conversations and delete the ones you don’t want to keep. We are gradually rolling out this feature.
#Daily limit: To ensure a high-quality experience for all ChatGPT users, we are experimenting with a daily message cap. If you’re included in this group, you’ll be presented with an option to extend your access by providing feedback to ChatGPT.
To see if you’re using the updated version, look for “ChatGPT Dec 15 Version” at the bottom of the screen.
====Comments and discoveries====
==SEO==
{{see also|SEO for ChatGPT}}


==ChatGPT Alternatives==
==ChatGPT Alternatives==