description | cover | coverY | layout | |||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NLP & LLM & RAG & GenAI & Agent 에 대한 최신 논문 동향을 파악하고 리스트를 정리해보자! |
0 |
|
- Attention is all you need
- CoVe : Chain of Verifiction Reduces Hallucination in Large Language Models
- RAG Survey : A Survey on Retrieval-Augmented Text Generation for Large Language Models
- Interleaving Retrieval with Chain-of-Thought for Knowledge-Intensive Multi-Step Questions
- Taka a Step Back : Evoking Reasoning via Abstraction in Large Language Models
- EEVE : Efficient and Effective Vocabulary Expansion Towards Mulitilingual Large Language Models
- Will GPT-4 Run Doom?
- PiSSA : Principal Singular Values and Singular Vectors Adaptation of Large Language Models
- RAFT : Adapting Language Model to Domain Specific RAG
- ReALM : Reference Resolution As Language Modeling
- WavLLM : Towards Robust and Adaptive Speech Large Language Model
- Batch Calibration : Rethinking Calibration for In-Context Learning and Prompt Engineering
- Tokenizer Choice For LLM Training: Negligible or Crucial?
- AIOS : LLM Agent Operating System
- Fine Tuning vs. Retrieval Augmented Generation for Less Popular Knowledge
- FollowIR : Evaluating and Teaching Information Retrieval Models to Follow Instructions
- Stealing Part of a Production Language Model
- The Claude 3 Model Family: Opus, Sonnet, Haiku
- Functional Benchmarks for Robust Evaluation of Reasoning Performance, and the Reasoning Gap
- GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
- Can Large Language Models Reason and Plan?
- ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks
- Retrieval-Augmented Generation for AI-Generated Content: A Survey
- KnowAgent: Knowledge-Augmented Planning for LLM-Based Agents
- Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models
- SaulLM-7B: A pioneering Large Language Model for Law
- Design2Code: How Far Are We From Automating Front-End Engineering?
- TripoSR: Fast 3D Object Reconstruction from a Single Image
- MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training
- MoE : Towards Understanding Mixture of Experts in Deep Learning
- SELF-DISCOVER: Large Language Models Self-Compose Reasoning Structures
- CLAM: Selective Clarification for Ambiguous Questions with Generative Language Models