Jump to content

To-Do List: Difference between revisions

no edit summary
No edit summary
No edit summary
Line 98: Line 98:


'''[[RLHF]]''' - [[Reinforcement Learning from Human Feedback]] - https://huggingface.co/blog/rlhf
'''[[RLHF]]''' - [[Reinforcement Learning from Human Feedback]] - https://huggingface.co/blog/rlhf


'''[[Transformer]]''' - https://news.ycombinator.com/item?id=34566275
'''[[Transformer]]''' - https://news.ycombinator.com/item?id=34566275