Cortiq

· 63편

Agentic AI

해당 날짜의 arXiv 발표에서 선별한 랭킹 브리프입니다. Cortiq은 주제 적합도, 주저자 맥락, 공개 연구 신호를 함께 봅니다.

1. Less Context, Better Agents: Efficient Context Engineering for Long-Horizon Tool-Using LLM Agents

Abhilasha Lodha, Mahsa Pahlavikhah Varnosfaderani, Abir Chakraborty, Abhinav Mithal

주저자 소속 - Stanford University

2. HIPIF: Hierarchical Planning and Information Folding for Long-Horizon LLM Agent Learning

Juncheng Diao, Zhicong Lu, Peiguang Li, Yongwei Zhou, Changyuan Tian, Qingbin Li, Rongxiang Weng, Jingang Wang, Xunliang Cai

주저자 소속 - Meituan

3. Pushing the Limits of LLM Tool Calling via Experiential Knowledge Integration and Activation

Yupu Hao, Zhuoran Jin, Huanxuan Liao, Kang Liu, Jun Zhao

주저자 소속 - The Key Laboratory of Cognition and Decision Intelligence, School of Computer Science and Technology, Shandong University, China

4. Toward Secure LLM Agents: Threat Surfaces, Attacks, Defenses, and Evaluation

Yuchen Ling, Shengcheng Yu, Zhenyu Chen, Chunrong Fang

주저자 소속 - State Key Laboratory for Novel Software Technology, Nanjing University

5. Assessing Automated Prompt Injection Attacks in Agentic Environments

David Hofer, Edoardo Debenedetti, Florian Tramèr

주저자 소속 - ETH Zurich

6. Divide and Cooperate: Role-Decomposed Multi-Agent LLM Training with Cross-Agent Learning Signals

Jaewan Park, Solbee Cho, Jay-Yoon Lee

주저자 소속 - Seoul National University

7. ABC-Bench: An Agentic Bio-Capabilities Benchmark for Biosecurity

Andrew Bo Liu, Samira Nedungadi, Bryce Cai, Alex Kleinman, Harmon Bhasin, Seth Donoughe

주저자 소속 - SecureBio, Cambridge, MA, USA

8. MIRAGE: A Polarity-Flipping Encoding Subspace in LLM Agents

Pratibha Revankar, Kargi Chauhan, Jihye Kim, Sadiba Nusrat Nur, Vincent Siu, Chenguang Wang

주저자 소속 - University of California, Santa Cruz

9. WebChallenger: A Reliable and Efficient Generalist Web Agent

Jayoo Hwang, Xiaowen Zhang, Vedant Padwal

주저자 소속 - ML Collective

10. T1-Bench: Benchmarking Multi-Scenario Agents in Real-World Domains

Genta Indra Winata, Amartya Chakraborty, Yuzhen Lin, Swasthi P Rao, Shikhhar Siingh, ..., Kshitij Tayal, Xiuzhu Lin, Anirban Das, Sambit Sahu, Shi-Xiong Zhang

주저자 소속 - AI Foundations, Capital One
나머지 53편 보기

18. 3SPO: State-Score-Supervised Policy Optimization for LLM Agents

Yu Han, Kailing Li, Yang Jiao, Yulin Dai, Yuqian Fu, Linhai Zhuo, Tianwen Qian

주저자 소속 - School of Computer Science and Technology, East China Normal University

19. TabClaw: An Interactive and Self-Evolving Agent for Spreadsheet Manipulation and Table Reasoning

Mingyue Cheng, Shuo Yu, Daoyu Wang, Qingchuan Li, Xiaoyu Tao, Qingyang Mao, Yitong Zhou, Qi Liu

주저자 소속 - State Key Laboratory of Cognitive Intelligence, University of Science and Technology of China

28. Effective Reinforcement Learning for Agentic Search by Recycling Zero-Variance Queries During Training

João Coelho, João Magalhães, Bruno Martins, Chenyan Xiong

주저자 소속 - Language Technologies Institute, Carnegie Mellon University, United States

42. EEVEE: Towards Test-time Prompt Learning in the Real World for Self-Improving Agents

Weixian Xu, Shilong Liu, Mengdi Wang

주저자 소속 - Shanghai Jiao Tong University, Princeton University

45. Mobility Anomaly Generation using LLM-Driven Behavior with Kinematic Constraints

Yueyang Liu, Joon-Seok Kim, Andreas Züfle

주저자 소속 - Intelligence Advanced Research Projects Activity (IARPA)

47. When the Chain of Thought Knows Better: Failure Modes in Multi-Turn Reasoning Models

Sai Kartheek Reddy Kasu, Nils Lukas, Samuele Poppi

주저자 소속 - Mohamed bin Zayed University of Artificial Intelligence (MBZUAI)

48. Moonshine: An Autonomous Mathematical Research Agent Centered on Conjecture Generation

Xiaoyang Chen, Xiang Jiang

주저자 소속 - School of Mathematical Science, Tongji University

55. Dep-LLM: Training-Free Depression Diagnosis via Evidence-Guided Structured Multi-factor with Reliable LLM Reasoning

Yiqing Lyu, Xianbing Zhao, Buzhou Tang, Ronghuan Jiang

주저자 소속 - School of Computer Science and Technology, Harbin Institute of Technology, Shenzhen, Guangdong, China

62. Dropout-GRPO: Variational Stochasticity for Continuous Latent Reasoning

Wooil Jung

주저자 소속 - University of California, San Diego