Cortiq

· 63 papers

Agentic AI

A ranked brief from the day's arXiv listing. Cortiq weighs topical fit, lead-author context, and public research signals before the issue is published.

1. Less Context, Better Agents: Efficient Context Engineering for Long-Horizon Tool-Using LLM Agents

Abhilasha Lodha, Mahsa Pahlavikhah Varnosfaderani, Abir Chakraborty, Abhinav Mithal

Lead affiliation - Stanford University

2. HIPIF: Hierarchical Planning and Information Folding for Long-Horizon LLM Agent Learning

Juncheng Diao, Zhicong Lu, Peiguang Li, Yongwei Zhou, Changyuan Tian, Qingbin Li, Rongxiang Weng, Jingang Wang, Xunliang Cai

Lead affiliation - Meituan

3. Pushing the Limits of LLM Tool Calling via Experiential Knowledge Integration and Activation

Yupu Hao, Zhuoran Jin, Huanxuan Liao, Kang Liu, Jun Zhao

Lead affiliation - The Key Laboratory of Cognition and Decision Intelligence, School of Computer Science and Technology, Shandong University, China

4. Toward Secure LLM Agents: Threat Surfaces, Attacks, Defenses, and Evaluation

Yuchen Ling, Shengcheng Yu, Zhenyu Chen, Chunrong Fang

Lead affiliation - State Key Laboratory for Novel Software Technology, Nanjing University

5. Assessing Automated Prompt Injection Attacks in Agentic Environments

David Hofer, Edoardo Debenedetti, Florian Tramèr

Lead affiliation - ETH Zurich

6. Divide and Cooperate: Role-Decomposed Multi-Agent LLM Training with Cross-Agent Learning Signals

Jaewan Park, Solbee Cho, Jay-Yoon Lee

Lead affiliation - Seoul National University

7. ABC-Bench: An Agentic Bio-Capabilities Benchmark for Biosecurity

Andrew Bo Liu, Samira Nedungadi, Bryce Cai, Alex Kleinman, Harmon Bhasin, Seth Donoughe

Lead affiliation - SecureBio, Cambridge, MA, USA

8. MIRAGE: A Polarity-Flipping Encoding Subspace in LLM Agents

Pratibha Revankar, Kargi Chauhan, Jihye Kim, Sadiba Nusrat Nur, Vincent Siu, Chenguang Wang

Lead affiliation - University of California, Santa Cruz

9. WebChallenger: A Reliable and Efficient Generalist Web Agent

Jayoo Hwang, Xiaowen Zhang, Vedant Padwal

Lead affiliation - ML Collective

10. T1-Bench: Benchmarking Multi-Scenario Agents in Real-World Domains

Genta Indra Winata, Amartya Chakraborty, Yuzhen Lin, Swasthi P Rao, Shikhhar Siingh, ..., Kshitij Tayal, Xiuzhu Lin, Anirban Das, Sambit Sahu, Shi-Xiong Zhang

Lead affiliation - AI Foundations, Capital One
Show 53 more

18. 3SPO: State-Score-Supervised Policy Optimization for LLM Agents

Yu Han, Kailing Li, Yang Jiao, Yulin Dai, Yuqian Fu, Linhai Zhuo, Tianwen Qian

Lead affiliation - School of Computer Science and Technology, East China Normal University

19. TabClaw: An Interactive and Self-Evolving Agent for Spreadsheet Manipulation and Table Reasoning

Mingyue Cheng, Shuo Yu, Daoyu Wang, Qingchuan Li, Xiaoyu Tao, Qingyang Mao, Yitong Zhou, Qi Liu

Lead affiliation - State Key Laboratory of Cognitive Intelligence, University of Science and Technology of China

28. Effective Reinforcement Learning for Agentic Search by Recycling Zero-Variance Queries During Training

João Coelho, João Magalhães, Bruno Martins, Chenyan Xiong

Lead affiliation - Language Technologies Institute, Carnegie Mellon University, United States

42. EEVEE: Towards Test-time Prompt Learning in the Real World for Self-Improving Agents

Weixian Xu, Shilong Liu, Mengdi Wang

Lead affiliation - Shanghai Jiao Tong University, Princeton University

45. Mobility Anomaly Generation using LLM-Driven Behavior with Kinematic Constraints

Yueyang Liu, Joon-Seok Kim, Andreas Züfle

Lead affiliation - Intelligence Advanced Research Projects Activity (IARPA)

47. When the Chain of Thought Knows Better: Failure Modes in Multi-Turn Reasoning Models

Sai Kartheek Reddy Kasu, Nils Lukas, Samuele Poppi

Lead affiliation - Mohamed bin Zayed University of Artificial Intelligence (MBZUAI)

48. Moonshine: An Autonomous Mathematical Research Agent Centered on Conjecture Generation

Xiaoyang Chen, Xiang Jiang

Lead affiliation - School of Mathematical Science, Tongji University

55. Dep-LLM: Training-Free Depression Diagnosis via Evidence-Guided Structured Multi-factor with Reliable LLM Reasoning

Yiqing Lyu, Xianbing Zhao, Buzhou Tang, Ronghuan Jiang

Lead affiliation - School of Computer Science and Technology, Harbin Institute of Technology, Shenzhen, Guangdong, China

62. Dropout-GRPO: Variational Stochasticity for Continuous Latent Reasoning

Wooil Jung

Lead affiliation - University of California, San Diego