Large Language Models
I plan to use this repository as a centralized hub for all the activities I intend to undertake concerning Large Language Models (LLM). Specifically, I intend to share the following items here:
- Significant papers related to LLMs
- Open-source datasets
- Open-source models
- Important projects
- Jupyter notebooks
It has been claimed that LLMs bring us very close to achieving General Artificial Intelligence (GAI) – potentially on par with or even superior to human intelligence! While language is indeed a vital tool in our cognitive toolbox, verbal and textual communication does not fully encompass the breadth of our communicative capabilities. Moreover, the language we employ in communication is fraught with subjectivity, variability, and arbitrariness.
There are assertions that when LLMs are trained on extensive datasets, they begin to exhibit surprising behaviors, analogous to phase transitions in physics! This is an extraordinary proposition! Essentially, this implies the potential for LLMs to develop consciousness, a characteristic traditionally considered emblematic of human identity.
I believe this concern has been somewhat exaggerated. Claims about the consequences of LLMs range from being as disruptive as nuclear wars to pandemics, alien attacks, or asteroid impacts!
This aspect is perhaps the easiest to comprehend and holds considerable truth. However, it is not dissimilar from the effects of past inventions such as the printing press, camera, or calculator. As people become increasingly reliant on tools like LLMs, certain abilities might diminish, much like how widespread use of mobile navigation systems diminished the importance of having a strong sense of direction.
- [https://arxiv.org/abs/1409.0473] (Neural Machine Translation by Jointly Learning to Align and Translate)
- [https://arxiv.org/abs/1706.03762] (Attention Is All You Need)
- [https://arxiv.org/abs/1810.04805] (BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding)
<<<<<<< HEAD
- [https://arxiv.org/abs/2307.03109](A Survey on Evaluation of Large Language Models)
- [https://arxiv.org/abs/2112.04359](Ethical and social risks of harm from Language Models)
- [https://arxiv.org/abs/2108.07258](On the Opportunities and Risks of Foundation Models)
- [https://arxiv.org/abs/2206.07682](Emergent Abilities of Large Language Models)
- [https://arxiv.org/abs/2303.18223](A Survey of Large Language Models)
- [https://arxiv.org/abs/2309.01029](Explainability for Large Language Models: A Survey)
- [https://arxiv.org/abs/2303.12712](Sparks of Arti cial General Intelligence:Early experiments with GPT-4)
- [https://arxiv.org/abs/2311.07361](The Impact of Large Language Models on Scientific Discovery:a Preliminary Study using GPT-4)