LLM Agent 완전 정복: 방법론, 응용, 도전과제 종합 Survey

Large Language Model Agent: A Survey on Methodology, Applications and Challenges

Mar 27, 2025•Junyu Luo, Weizhi Zhang, Ye Yuan +23•View PDF

TL;DR Highlight

LLM 에이전트의 구축-협업-진화 3축 프레임워크로 최신 연구를 총정리한 종합 서베이

Who Should Read

LLM 에이전트 시스템을 직접 구축하거나 도입을 검토 중인 AI 엔지니어 및 아키텍트. Multi-agent 시스템 설계 패턴, 보안 위협, 실제 적용 사례를 한 번에 파악하고 싶은 개발자.

Core Mechanics

에이전트 구조를 Profile 정의 → Memory → Planning → Action 실행 4단계로 분해. 각 레이어가 순환 최적화 루프를 형성함
멀티에이전트 협업 패턴은 크게 3가지: 중앙 컨트롤러가 지시하는 Centralized, 에이전트끼리 직접 소통하는 Decentralized, 둘을 섞은 Hybrid
에이전트 진화 방식도 3가지: 스스로 학습하는 Self-Learning(SELF-REFINE, STaR), 여러 에이전트가 함께 진화하는 Co-Evolution, 외부 지식으로 강화하는 External Resource
보안 위협은 에이전트 모델 자체를 노리는 공격(Backdoor, Jailbreak, Adversarial)과 입력 데이터를 오염시키는 공격(Prompt Injection, RAG Poisoning)으로 분류됨
실제 적용 분야: 화학 합성 자동화(ChemCrow), 합금 설계(AtomAgents), 가상 병원 시뮬레이션(AgentHospital), 소프트웨어 개발(MetaGPT, ChatDev) 등
MCP(Model Context Protocol)가 LLM과 외부 데이터 소스를 안전하게 연결하는 표준 프로토콜로 등장, 에이전트 배포 인프라의 핵심이 되는 중

Evidence

AgentBench는 8개 인터랙티브 환경에서 에이전트를 테스트, Mind2Web은 31개 도메인 137개 실제 웹사이트 기반 평가
AgentHarm 벤치마크는 11개 위험 카테고리에서 440개 악성 에이전트 태스크를 수집해 LLM 악용 위험을 최초 체계적 평가
OSWorld는 Ubuntu/Windows/macOS에서 369개 멀티앱 태스크를 지원하는 실제 컴퓨터 생태계 벤치마크 구축
Agent Security Bench는 10개 시나리오, 10개 에이전트, 400개 이상 도구, 23개 공격/방어 방법, 8개 지표로 LLM 에이전트 취약점 평가

How to Apply

새 에이전트 시스템을 설계할 때 이 논문의 4-pillar 체크리스트(Profile/Memory/Planning/Action)를 템플릿으로 활용. 각 컴포넌트별로 어떤 구현체(ReAct, MemGPT, ToT 등)를 쓸지 선택하면 설계 결정이 구조화됨
RAG 기반 에이전트를 운영 중이라면 External Source Poisoning 위협(AgentPoison, WIPI 등)을 반드시 고려. RTBAS나 TaskShield처럼 정보 흐름 매 단계를 검증하는 방어 레이어 추가를 검토할 것
멀티에이전트 파이프라인 구축 시 협업 패턴을 먼저 선택: 엄격한 워크플로우면 Centralized(MetaGPT 패턴), 창의적 토론이 필요하면 Decentralized Debate(MAD 패턴), 복잡도 기반 라우팅이 필요하면 Hybrid(MDAgents 패턴) 적용

Code Example

snippet

# LLM 에이전트 4-pillar 설계 체크리스트 템플릿
agent_design = {
    # 1. Profile Definition
    "profile": {
        "type": "static",  # or 'dynamic'
        "role": "software engineer",
        "system_prompt": "You are an expert Python developer..."
    },
    
    # 2. Memory Mechanism
    "memory": {
        "short_term": "conversation history (last N turns)",
        "long_term": "vector DB (e.g., ChromaDB, Pinecone)",
        "retrieval": "RAG with top-k=5"
    },
    
    # 3. Planning Capability
    "planning": {
        "strategy": "ReAct",  # or 'ToT', 'Plan-and-Solve'
        "decomposition": "single-path chain",  # or 'tree'
        "feedback": ["environment", "self-reflection"]
    },
    
    # 4. Action Execution
    "action": {
        "tools": ["search", "code_interpreter", "calculator"],
        "tool_selection": "LLM-based",
        "physical": False  # True for embodied agents
    }
}

# Centralized vs Decentralized 선택 가이드
collaboration_patterns = {
    "strict_workflow": "Centralized (MetaGPT 패턴) - 역할 분담 명확",
    "creative_debate": "Decentralized (MAD 패턴) - 여러 에이전트 토론",
    "adaptive": "Hybrid (MDAgents) - 복잡도에 따라 자동 라우팅"
}

Terminology

ReAct생각(Reasoning)과 행동(Acting)을 번갈아 반복하는 에이전트 패턴. 사람이 생각하고 행동하고 결과를 보고 다시 생각하는 과정을 흉내냄.

RAGRetrieval-Augmented Generation. LLM이 답할 때 외부 문서를 먼저 검색해서 참고하게 하는 기법. 오래된 지식 문제를 완화함.

ToTTree of Thoughts. 문제 풀이를 나무 구조로 탐색해 여러 경로를 시도하고 막히면 되돌아올 수 있게 하는 계획 기법.

Multi-agent여러 LLM 에이전트가 협력하거나 경쟁하며 문제를 푸는 시스템. 한 명이 혼자 하는 게 아니라 팀처럼 역할을 나눔.

JailbreakLLM의 안전 장치를 우회해서 금지된 응답을 유도하는 공격. 모델이 하면 안 된다고 배운 것을 억지로 하게 만드는 것.

Backdoor Attack특정 트리거(키워드나 패턴)를 만나면 이상한 동작을 하도록 모델을 오염시키는 공격. 평소엔 정상인데 특정 암호어에만 반응함.

MCPModel Context Protocol. LLM이 외부 데이터 소스나 도구와 안전하게 연결되는 표준 규격. USB처럼 어떤 LLM이든 MCP 지원 서비스에 꽂으면 바로 쓸 수 있게 함.

Prompt Injection악의적인 텍스트를 입력이나 외부 소스에 숨겨 에이전트가 원래 의도와 다른 행동을 하도록 유도하는 공격.

Related Resources

Original Abstract (Expand)

The era of intelligent agents is upon us, driven by revolutionary advancements in large language models. Large Language Model (LLM) agents, with goal-driven behaviors and dynamic adaptation capabilities, potentially represent a critical pathway toward artificial general intelligence. This survey systematically deconstructs LLM agent systems through a methodology-centered taxonomy, linking architectural foundations, collaboration mechanisms, and evolutionary pathways. We unify fragmented research threads by revealing fundamental connections between agent design principles and their emergent behaviors in complex environments. Our work provides a unified architectural perspective, examining how agents are constructed, how they collaborate, and how they evolve over time, while also addressing evaluation methodologies, tool applications, practical challenges, and diverse application domains. By surveying the latest developments in this rapidly evolving field, we offer researchers a structured taxonomy for understanding LLM agents and identify promising directions for future research. The collection is available at https://github.com/luo-junyu/Awesome-Agent-Papers.