
5个多模态大模型研究方向
4个多模态大模型关键技术
TOP28多模态大模型(源码)
大模型Agent与RLHF论文
两篇多模态大模型综述论文
├───4个多模态大模型关键技术│ ├───LLM辅助视觉推理│ │ Accountable Textual-Visual Chat Learns to Reject Human Instructions in Image Re-creation.pdf│ │ AssistGPT A General Multi-modal Assistant that can Plan, Execute, Inspect, and Learn.pdf│ │ Caption Anything Interactive Image Description with Diverse Multimodal Controls.pdf│ │ Chameleon Plug-and-Play Compositional Reasoning with Large Language Models.pdf│ │ ChatGPT Asks BLIP-2 Answers Automatic Questioning Towards Enriched Visual Descriptions.pdf│ │ GPT4Tools Teaching Large Language Model to Use Tools via Self-instruction.pdf│ │ HuggingGPT Solving AI Tasks with ChatGPT and its Friends in HuggingFace.pdf│ │ IdealGPT Iteratively Decomposing Vision and Language Reasoning via Large Language Models.pdf│ │ LayoutGPT Compositional Visual Planning and Generation with Large Language Models.pdf│ │ Mindstorms in Natural Language-Based Societies of Mind.pdf│ │ MM-REACT Prompting ChatGPT for Multimodal Reasoning and Action.pdf│ │ PointCLIP V2 Adapting CLIP for Powerful 3D Open-world Learning.pdf│ │ Prompt, Generate, then Cache Cascade of Foundation Models makes Strong Few-shot Learners.pdf│ │ Retrieving-to-Answer Zero-Shot Video Question Answering with Frozen Large Language Models.pdf│ │ Socratic Models Composing Zero-Shot Multimodal Reasoning with Language.pdf│ │ SuS-X Training-Free Name-Only Transfer of Vision-Language Models.pdf│ │ ViperGPT Visual Inference via Python Execution for Reasoning.pdf│ │ Visual ChatGPT Talking, Drawing and Editing with Visual Foundation Models.pdf│ │ Visual Programming Compositional visual reasoning without training.pdf│ │ │ ├───多模态上下文学习│ │ HowToCaption Prompting LLMs to Transform Video Annotations at Scale.pdf│ │ Language as the Medium Multimodal Video Classification through text only.pdf│ │ Large Language Models are Visual Reasoning Coordinators.pdf│ │ Lightweight In-Context Tuning for Multimodal Unified Models.pdf│ │ Link-Context Learning for Multimodal LLMs.pdf│ │ MMHQA-ICL Multimodal In-context Learning for Hybrid Question Answering over Text, Tables and Images.pdf│ │ Multimodal Foundation Models For Echocardiogram Interpretation.pdf│ │ Proactive Human-Robot Interaction using Visuo-Lingual Transformers.pdf│ │ Revisiting Disentanglement and Fusion on Modality and Context in Conversational Multimodal Emotion Recognition.pdf│ │ │ ├───多模态思维链│ │ Caption Anything Interactive Image Description with Diverse Multimodal Controls.pdf│ │ Chain of Thought Prompt Tuning in Vision Language Models.pdf│ │ Chameleon Plug-and-Play Compositional Reasoning with Large Language Models.pdf│ │ EmbodiedGPT Vision-Language Pre-Training via Embodied Chain of Thought.pdf│ │ Explainable Multimodal Emotion Reasoning.pdf│ │ Learn to Explain Multimodal Reasoning via Thought Chains for Science Question Answering.pdf│ │ Let’s Think Frame by Frame Evaluating Video Chain of Thought with Video Infilling and Prediction.pdf│ │ MM-REACT Prompting ChatGPT for Multimodal Reasoning and Action.pdf│ │ Multimodal Chain-of-Thought Reasoning in Language Models.pdf│ │ Visual Chain of Thought Bridging Logical Gaps with Multimodal Infillings.pdf│ │ Visual ChatGPT Talking, Drawing and Editing with Visual Foundation Models.pdf│ │ Visual Programming Compositional visual reasoning without training.pdf│ │ │ └───多模态指令微调│ Aligning Large Multi-Modal Model with Robust Instruction Tuning.pdf│ ChatBridge Bridging Modalities with Large Language Model as a Language Catalyst.pdf│ Cheap and Quick Efficient Vision-Language Instruction Tuning for Large Language Models.pdf│ DetGPT Detect What You Need via Reasoning.pdf│ GPT4Tools Teaching Large Language Model to Use Tools via Self-instruction.pdf│ InstructBLIP Towards General-purpose Vision-Language Models with Instruction Tuning.pdf│ LAMM Language-Assisted Multi-Modal Instruction-Tuning Dataset, Framework, and Benchmark.pdf│ Listen, Think, and Understand.pdf│ LLaMA-Adapter Efficient Fine-tuning of Language Models with Zero-init Attention.pdf│ LLaMA-Adapter V2 Parameter-Efficient Visual Instruction Model.pdf│ LLaVA-Med Training a Large Language-and-Vision Assistant for Biomedicine in One Day.pdf│ LLaVAR Enhanced Visual Instruction Tuning for Text-Rich Image Understanding.pdf│ LMEye An Interactive Perception Network for Large Language Models.pdf│ M3IT A Large-Scale Dataset towards Multi-Modal Multilingual Instruction Tuning.pdf│ Macaw-LLM Multi-Modal Language Modeling with Image, Audio, Video, and Text Integration.pdf│ MIMIC-IT Multi-Modal In-Context Instruction Tuning.pdf│ MiniGPT-4 Enhancing Vision-Language Understanding with Advanced Large Language Models.pdf│ mPLUG-Owl Modularization Empowers Large Language Models with Multimodality.pdf│ MultiInstruct Improving Multi-Modal Zero-Shot Learning via Instruction Tuning.pdf│ MultiModal-GPT A Vision and Language Model for Dialogue with Humans.pdf│ PandaGPT One Model To Instruction-Follow Them All.pdf│ PMC-VQA Visual Instruction Tuning for Medical Visual Question Answering.pdf│ Shikra Unleashing Multimodal LLM's Referential Dialogue Magic.pdf│ Video-ChatGPT Towards Detailed Video Understanding via Large Vision and Language Models.pdf│ Video-LLaMA An Instruction-tuned Audio-Visual Language Model for Video Understanding.pdf│ VideoChat Chat-Centric Video Understanding.pdf│ VisionLLM Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks.pdf│ Visual Instruction Tuning with Polite Flamingo.pdf│ Visual Instruction Tuning.pdf│ X-LLM Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages.pdf│ ├───5个多模态大模型研究方向│ ├───LLM加持的多模态大模型│ │ Contextual Object Detection with Multimodal Large Language Models.pdf│ │ Fine-grained Audio-Visual Joint Representations for Multimodal Large Language Models.pdf│ │ MM-Vet Evaluating Large Multimodal Models.pdf│ │ MME A Comprehensive Evaluation Benchmark for Multimodal Large Language Models.pdf│ │ SCITUNE Aligning Large Language Models with Scientific Multimodal.pdf│ │ X-LLM Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages.pdf│ │ │ ├───多模态agent│ │ A Contextualized Real-Time Multimodal Emotion Recognition for Conversational Agents using Graph Convolutional Networks in Reinforcement Learning.pdf│ │ Clinically-Inspired Multi-Agent Transformers for Disease Trajectory Forecasting from Multimodal Data.pdf│ │ Guide Your Agent with Adaptive Multimodal Rewards.pdf│ │ Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback.pdf│ │ Instruction-Following Agents with Multimodal Transformer.pdf│ │ Multimodal Speech Recognition for Language-Guided Embodied Agents.pdf│ │ SPRING Situated Conversation Agent Pretrained with Multimodal Questions from Incremental Layout Graph.pdf│ │ The Importance of Multimodal Emotion Conditioning and Affect Consistency for Embodied Conversational Agents.pdf│ │ You Only Look at Screens Multimodal Chain-of-Action Agents.pdf│ │ │ ├───统一视觉模型│ │ BLIP Bootstrapping Language-Image Pre-training for.pdf│ │ Pro-tuning Unified Prompt Tuning for Vision Tasks.pdf│ │ UNIFIED VISION AND LANGUAGE PROMPT LEARNING.pdf│ │ Unified Vision-Language Pre-Training for Image Captioning and VQA.pdf│ │ VLMO Unified Vision-Language Pre-Training with.pdf│ │ You Need Multiple Exiting Dynamic Early Exiting for.pdf│ │ │ ├───视觉理解│ │ Cream Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models.pdf│ │ DocFormerv2 Local Features for Document Understanding.pdf│ │ LLaVAR Enhanced Visual Instruction Tuning for Text-Rich Image Understanding.pdf│ │ M3IT A Large-Scale Dataset towards Multi-Modal Multilingual Instruction Tuning.pdf│ │ mPLUG-DocOwl Modularized Multimodal Large Language Model for Document Understanding.pdf│ │ Multimodal Transformer for Multimodal Machine Translation.pdf│ │ On the Performance of Multimodal Language Models.pdf│ │ PDFVQA A New Dataset for Real-World VQA on PDF Documents.pdf│ │ TouchStone Evaluating Vision-Language Models by Language Models.pdf│ │ UReader Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model.pdf│ │ │ └───视觉生成│ Coordinated Joint Multimodal Embeddings for Generalized Audio-Visual Zeroshot Classification and Retrieval of Videos.pdf│ Enabling Robots to Draw and Tell Towards Visually Grounded Multimodal Description Generation.pdf│ Generation of Multimodal Justification Using Visual Word Constraint Model for Explainable Computer-Aided Diagnosis.pdf│ KM-BART Knowledge Enhanced Multimodal BART for Visual Commonsense Generation.pdf│ Multimodal Differential Network for Visual Question Generation.pdf│ Multimodal Incremental Transformer with Visual Grounding for Visual Dialogue Generation.pdf│ Multimodal Prompt Retrieval for Generative Visual Question Answering.pdf│ Opal Multimodal Image Generation for News Illustration.pdf│ TextPainter Multimodal Text Image Generation with Visual-harmony and Text-comprehension for Poster Design.pdf│ ├───TOP28多模态大模型(源码)│ ├───BLIP-2│ │ LAVIS-main.zip│ │ │ ├───BLIVA│ │ BLIVA-main.zip│ │ │ ├───Cheetor│ │ Cheetah-main.zip│ │ │ ├───GIT2│ │ GenerativeImage2Text-main.zip│ │ │ ├───GPT-4V│ │ GPTV_System_Card.pdf│ │ │ ├───ImageBind_LLM│ │ ImageBind_LLM.zip│ │ │ ├───InstructBLIP│ │ LAVIS-main.zip│ │ │ ├───InternLM-XComposer│ │ InternLM-XComposer-main.zip│ │ │ ├───LaVIN│ │ LaVIN-main.zip│ │ │ ├───Lion│ │ Lion-main.zip│ │ │ ├───LLaMA-Adapter V2│ │ LLaMA-Adapter-main.zip│ │ │ ├───LLaVA│ │ LLaVA-main.zip│ │ │ ├───LRV-Instruction│ │ LRV-Instruction-main.zip│ │ │ ├───Lynx│ │ lynx-llm-main.zip│ │ │ ├───MiniGPT-4│ │ MiniGPT-4-main.zip│ │ │ ├───MMICL│ │ MIC-master.zip│ │ │ ├───mPLUG-Owl│ │ mPLUG-Owl-main.zip│ │ │ ├───Muffin│ │ Muffin-main.zip│ │ │ ├───Multimodal-GPT│ │ Multimodal-GPT-main.zip│ │ │ ├───Octopus│ │ UnifiedMultimodalInstructionTuning-main.zip│ │ │ ├───Otter│ │ Otter-main.zip│ │ │ ├───PandaGPT│ │ PandaGPT-main.zip│ │ │ ├───Qwen-VL-Chat│ │ Qwen-VL-master.zip│ │ │ ├───Skywork-MM│ │ Skywork-MM-main.zip│ │ │ ├───SPHINX│ │ LLaMA2-Accessory-main.zip│ │ │ ├───VisualGLM-6B│ │ VisualGLM-6B-main.zip│ │ │ ├───VPGTrans│ │ VPGTrans-main.zip│ │ │ └───WeMM│ WeMM-main.zip│ ├───两篇多模态大模型综述论文│ 微软最全综述:Multimodal Foundation Models From Specialists to General-Purpose Assistants.pdf│ 首篇综述:A Survey on Multimodal Large Language Models.pdf│ └───大模型Agent与RLHF论文 ├───大模型Agent论文合集 │ │ A Language-Agent Approach to Formal Theorem-Proving.pdf │ │ A real-world webagent with planning, long context understanding, and program synthesis.pdf │ │ Adapting LLM Agents Through Communication.pdf │ │ Agent Instructs Large Language Models to be General Zero-Shot Reasoners.pdf │ │ AgentTuning Enabling Generalized Agent Abilities for LLMs.pdf │ │ All in One Multi-task Prompting for Graph Neural Networks.pdf │ │ Ambient Adventures Teaching ChatGPT on Developing Complex Stories.pdf │ │ An Embodied Generalist Agent in 3D World.pdf │ │ AudioLDM 2 Learning Holistic Audio Generation with Self-supervised Pretraining.pdf │ │ Augmenting Language Models with Long-Term Memory.pdf │ │ Auto-GPT for Online Decision Making Benchmarks and Additional Opinions.pdf │ │ AutoAgents A Framework for Automatic Agent Generation.pdf │ │ Benchmarking Large Language Models as AI Research Agents.pdf │ │ Chain of hindsight aligns language models with feedback.pdf │ │ Chain-of-thought prompting elicits reasoning in large language models.pdf │ │ CHATANYTHING FACETIME CHAT WITH LLM-ENHANCED PERSONAS.pdf │ │ ChatMOF An Autonomous AI System for Predicting and Generating Metal-Organic Frameworks.pdf │ │ CLIN A Continually Learning Language Agent for Rapid Task Adaptation and Generalization.pdf │ │ Code Llama Open Foundation Models for Code.pdf │ │ Communicative agents for software development.pdf │ │ Consciousness in Artificial Intelligence Insights from the Science of Consciousness.pdf │ │ Cumulative Reasoning With Large Language Models.pdf │ │ Deception Abilities Emerged in Large Language Models.pdf │ │ Diversifying AI Towards Creative Chess with AlphaZero.pdf │ │ Does Role-Playing Chatbots Capture the Character Personalities Assessing Personality Traits for Role-Playing Chatbots.pdf │ │ Dynamic LLM-Agent Network An LLM-agent Collaboration Framework with Agent Team Optimization.pdf │ │ Evaluating Large Language Models at Evaluating Instruction Following.pdf │ │ Exploring Large Language Models for Communication Games An Empirical Study on Werewolf.pdf │ │ Few-shot learning with retrieval augmented language models.pdf │ │ Formally Specifying the High-Level Behavior of LLM-Based Agents.pdf │ │ Generative agents Interactive simulacra of human behavior.pdf │ │ Gorilla Large language model connected with massive apis.pdf │ │ InstructionGPT-4 A 200-Instruction Paradigm for Fine-Tuning MiniGPT-4.pdf │ │ Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models.pdf │ │ Language Agents for Detecting Implicit Stereotypes in Text-to-image Models at Scale.pdf │ │ Large Language Models for Information Retrieval A Survey.pdf │ │ Learning to Identify Critical States for Reinforcement Learning from Videos.pdf │ │ Learning to Reason and Memorize with Self-Notes.pdf │ │ Lemur Harmonizing Natural Language and Code for Language Agents.pdf │ │ LLM-Deliberation Evaluating LLMs with Interactive Multi-Agent Negotiation Game.pdf │ │ Memory augmented large language models are computationally universal.pdf │ │ Memory Sandbox Transparent and Interactive Memory Management for Conversational Agents.pdf │ │ Mind the Gap Improving Success Rate of Vision-and-Language Navigation by Revisiting Oracle Success Routes.pdf │ │ Multimodal Web Navigation with Instruction-Finetuned Foundation Models.pdf │ │ OKR-Agent An Object and Key Results Driven Agent System with Hierarchical Self-Collaboration and Self-Evaluation.pdf │ │ Pal Program-aided language models.pdf │ │ Put Your Money Where Your Mouth Is Evaluating Strategic Planning and Execution of LLM Agents in an Auction Arena.pdf │ │ Quantifying the Impact of Large Language Models on Collective Opinion Dynamics.pdf │ │ React Synergizing reasoning and acting in language models.pdf │ │ Reinforcement Learning for Generative AI A Survey.pdf │ │ Retroformer Retrospective Large Language Agents with Policy Gradient Optimization.pdf │ │ REX Rapid Exploration and eXploitation for AI agents.pdf │ │ ROLELLM BENCHMARKING, ELICITING, AND ENHANCING ROLE-PLAYING ABILITIES OF LARGE LANGUAGE MODELS.pdf │ │ SAPIEN Affective Virtual Agents Powered by Large Language Models.pdf │ │ SeamlessM4T-Massively Multilingual & Multimodal Machine Translation.pdf │ │ Self-Alignment with Instruction Backtranslation.pdf │ │ Steve-Eye Equipping LLM-based Embodied Agents with Visual Perception in Open Worlds.pdf │ │ Toolformer Language models can teach themselves to use tools.pdf │ │ ToolLLM Facilitating large language models to master 16000+ real-world apis.pdf │ │ Towards a unified agent with foundation models.pdf │ │ Towards More Human-Like AI Communication.pdf │ │ TPTU Task Planning and Tool Usage of Large Language Model-based AI Agents.pdf │ │ Trustworthy LLMs a Survey and Guideline for Evaluating Large Language Models' Alignment.pdf │ │ Voyager An open-ended embodied agent with large language models.pdf │ │ You Only Look at Screens Multimodal Chain-of-Action Agents.pdf │ │ │ ├───2篇综述 │ │ A Survey on Large Language Model-based Autonomous Agents.pdf │ │ The Rise and Potential of Large Language ModelBased Agents A Survey.pdf │ │ │ ├───EMNLP2023 LLM Agent │ │ A Zero-Shot Language Agent for Computer Control with Structured Reflection.pdf │ │ AgentSims An Open-Source Sandbox for Large Language Model Evaluation.pdf │ │ Answering Questions by Meta-Reasoning over Multiple Chains of Thought.pdf │ │ API-Bank A Comprehensive Benchmark for Tool-Augmented LLMs.pdf │ │ AutoTrial Prompting Language Models for Clinical Trial Design.pdf │ │ Character-LLM A Trainable Agent for Role-Playing.pdf │ │ ChatCoT Tool-Augmented Chain-of-Thought Reasoning on Chat-based Large Language Models.pdf │ │ CRYSTAL Introspective Reasoners Reinforced with Self-Feedback.pdf │ │ Dialogue Chain-of-Thought Distillation for Commonsense-aware Conversational Agents.pdf │ │ Examining Inter-Consistency of Large Language Models Collaboration An In-depth Analysis via Debate.pdf │ │ Humanoid Agents Platform for Simulating Human-like Generative Agents.pdf │ │ Logic-LM Empowering Large Language Models with Symbolic Solvers for Faithful Logical Reasoning.pdf │ │ MoT Memory-of-Thought Enables ChatGPT to Self-Improve.pdf │ │ Prompt-Based Monte-Carlo Tree Search for Goal-Oriented Dialogue Policy Planning.pdf │ │ Reasoning with Language Model is Planning with World Model.pdf │ │ SelfCheckGPT Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models.pdf │ │ The CoT Collection Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning.pdf │ │ Theory of Mind for Multi-Agent Collaboration via Large Language Models.pdf │ │ │ ├───ICLR2024 LLM Agent │ │ AgentBench Evaluating LLMs as Agents.pdf │ │ AgentVerse Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors.pdf │ │ Avalon's Game of Thoughts Battle Against Deception through Recursive Contemplation.pdf │ │ Building Cooperative Embodied Agents Modularly with Large Language Models.pdf │ │ Evaluating Multi-Agent Coordination Abilities in Large Language Models.pdf │ │ Exploring Collaboration Mechanisms for LLM Agents A Social Psychology View.pdf │ │ Identifying the Risks of LM Agents with an LM-Emulated Sandbox.pdf │ │ Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game.pdf │ │ Lyfe Agents generative agents for low-cost real-time social interaction.pdf │ │ MindAgent Emergent Gaming Interaction.pdf │ │ Playing repeated games with Large Language Models.pdf │ │ SmartPlay A Benchmark for LLMs as Intelligent Agents.pdf │ │ SOTOPIA Interactive Evaluation for Social Intelligence in Language Agents.pdf │ │ WebArena A Realistic Web Environment for Building Autonomous Agents.pdf │ │ Welfare Diplomacy Benchmarking Language Model Cooperation.pdf │ │ │ ├───LLM-based Agent应用 │ │ 3D-LLM Injecting the 3D World into Large Language Models.pdf │ │ Agents An Open-source Framework for Autonomous Language Agents.pdf │ │ CGMI Configurable General Multi-Agent Interaction Framework.pdf │ │ ChatEval Towards Better LLM-based Evaluators through Multi-Agent Debate.pdf │ │ ChatLLM Network More brains, More intelligence.pdf │ │ ChatMOF An Autonomous AI System for Predicting and Generating Metal-Organic Frameworks.pdf │ │ Do Embodied Agents Dream of Pixelated Sheep Embodied Decision Making using Language Guided World Modelling.pdf │ │ Improving Factuality and Reasoning in Language Models through Multiagent Debate.pdf │ │ Improving Language Model Negotiation with Self-Play and In-Context Learning from AI Feedback.pdf │ │ InterAct Exploring the Potentials of ChatGPT as a Cooperative Agent.pdf │ │ Language Models as Zero-Shot Planners Extracting Actionable Knowledge for Embodied Agents.pdf │ │ MetaGPT Meta Programming for A Multi-Agent Collaborative Framework.pdf │ │ Multi-Agent Collaboration Harnessing the Power of Intelligent LLM Agents.pdf │ │ Plan, Eliminate, and Track -- Language Models are Good Teachers for Embodied Agents.pdf │ │ ProAgent Building Proactive Cooperative AI with Large Language Models.pdf │ │ RoCo Dialectic Multi-Robot Collaboration with Large Language Models.pdf │ │ ScienceWorld Is your Agent Smarter than a 5th Grader.pdf │ │ SheetCopilot Bringing Software Productivity to the Next Level through Large Language Models.pdf │ │ Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks.pdf │ │ SwiftSage A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks.pdf │ │ The Hitchhiker's Guide to Program Analysis A Journey with Large Language Models.pdf │ │ WebGPT Browser-assisted question-answering with human feedback.pdf │ │ Zero-shot Visual Relation Detection via Composite Visual Cues from Large Language Models.pdf │ │ │ ├───LLM-based Agent构建 │ │ A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity.pdf │ │ Agent Instructs Large Language Models to be General Zero-Shot Reasoners.pdf │ │ Agents An Open-source Framework for Autonomous Language Agents.pdf │ │ AudioGPT Understanding and Generating Speech, Music, Sound, and Talking Head.pdf │ │ AutoGen Enabling Next-Gen LLM Applications via Multi-Agent Conversation.pdf │ │ AVIS Autonomous Visual Information Seeking with Large Language Model Agent.pdf │ │ CAMEL Communicative Agents for “Mind” Exploration of Large Scale Language Model Society..pdf │ │ Clever Hans or Neural Theory of Mind Stress Testing Social Reasoning in Large Language Models.pdf │ │ HuggingGPT Solving AI Tasks with ChatGPT and its Friends in Hugging Face.pdf │ │ InstructBLIP Towards General-purpose Vision-Language Models with Instruction Tuning.pdf │ │ InternGPT Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language.pdf │ │ Large Language Models as Tool Makers.pdf │ │ Learning Distributed Representations of Sentences from Unlabelled Data.pdf │ │ LLM+P Empowering Large Language Models with Optimal Planning Proficiency.pdf │ │ MiniGPT-4 Enhancing Vision-Language Understanding with Advanced Large Language Models.pdf │ │ PandaGPT One Model To Instruction-Follow Them All.pdf │ │ Reflexion language agents with verbal reinforcement learning..pdf │ │ SwiftSage A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks.pdf │ │ Think-on-Graph Deep and Responsible Reasoning of Large Language Model on Knowledge Graph.pdf │ │ Towards End-to-End Embodied Decision Making via Multi-modal Large Language Model Explorations with GPT4-Vision and Beyond.pdf │ │ Tree of Thoughts Deliberate Problem Solving with Large Language Models..pdf │ │ Visual Instruction Tuning.pdf │ │ │ ├───LLM-based Agent评估 │ │ Evaluating Cognitive Maps and Planning in Large Language Models with CogEval.pdf │ │ On the Planning Abilities of Large Language Models (A Critical Investigation with a Proposed Benchmark).pdf │ │ │ └───NeurIPS2023 LLM Agent │ Describe, Explain, Plan and Select Interactive Planning with LLMs Enables Open-World Multi-Task Agents.pdf │ GPT4Tools Teaching Large Language Model to Use Tools via Self-instruction.pdf │ Large Language Models Are Semi-Parametric Reinforcement Learning Agents.pdf │ Large Language Models as Commonsense Knowledge for Large-Scale Task Planning.pdf │ Large Language Models can Implement Policy Iteration.pdf │ Large Language Models of Code Fail at Completing Code with Potential Bugs.pdf │ Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Planning.pdf │ LLMs for Semi-Automated Data Science Introducing CAAFE for Context-Aware Automated Feature Engineering.pdf │ Testing the General Deductive Reasoning Capacity of Large Language Models Using OOD Examples.pdf │ └───大模型RLHF论文合集 Aligning Language Models with Preferences through f-divergence Minimization.pdf Better Aligning Text-to-Image Models with Human Preference.pdf Constitutional AI Harmlessness from AI Feedback.pdf Deep Reinforcement Learning from Human Preferences.pdf Deep TAMER Interactive Agent Shaping in High-Dimensional State Spaces.pdf Discovering Language Model Behaviors with Model-Written Evaluations.pdf Dynamic Planning in Open-Ended Dialogue using Reinforcement Learning.pdf Few-shot Preference Learning for Human-in-the-Loop RL.pdf Fine-Tuning Language Models from Human Preferences.pdf GPT-4 Technical Report.pdf Improving alignment of dialogue agents via targeted human judgements.pdf InstructGPT Training language models to follow instructions with human feedback.pdf Interactive Learning from Policy-Dependent Human Feedback.pdf Is Reinforcement Learning (Not) for Natural Language Processing Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization.pdf Learning to summarize from human feedback.pdf Learning to summarize with human feedback.pdf Non-Markovian Reward Modelling from Trajectory Labels via Interpretable Multiple Instance Learning.pdf Pretraining Language Models with Human Preferences.pdf Quark Controllable Text Generation with Reinforced Unlearning.pdf Recursively Summarizing Books with Human Feedback.pdf Red Teaming Language Models to Reduce Harms Methods, Scaling Behaviors, and Lessons Learned.pdf Revisiting the Weaknesses of Reinforcement Learning for Neural Machine Translation.pdf Reward learning from human preferences and demonstrations in Atari.pdf Scalable agent alignment via reward modeling a research direction.pdf Scaling Laws for Reward Model Overoptimization.pdf Teaching language models to support answers with verified quotes.pdf Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback.pdf Training language models to follow instructions with human feedback.pdf Visual ChatGPT Talking, Drawing and Editing with Visual Foundation Models.pdf WebGPT Browser-assisted question-answering with human feedback.pdf
声明:本站所有文章,如无特殊说明或标注,均为本站原创发布。任何个人或组织,在未征得本站同意时,禁止复制、盗用、采集、发布本站内容到任何网站、书籍等各类媒体平台。如若本站内容侵犯了原著者的合法权益,可联系我们进行处理。
