Cortiq

· 72편

Agentic AI

해당 날짜의 arXiv 발표에서 선별한 랭킹 브리프입니다. Cortiq은 주제 적합도, 주저자 맥락, 공개 연구 신호를 함께 봅니다.

1. RUBAS: Rubric-Based Reinforcement Learning for Agent Safety

Xian Qi Loye, Qinglin Su, Zhexin Zhang, Shiyao Cui, Qi Zhu, Fei Mi, Hongning Wang, Minlie Huang

주저자 소속 - The Conversational AI (CoAI) group, DCST, Tsinghua University

2. Plan First, Judge Later, Run Better: A DMAIC-Inspired Agentic System for Industrial Anomaly Detection

Yongzi Yu, Ao Li, Le Wang, Ziyue Li, Fugee Tsung, Yuxuan Liang, Man Li

주저자 소속 - The Hong Kong University of Science and Technology (Guangzhou)

3. MapAgent: An Industrial-Grade Agentic Framework for City-scale Lane-level Map Generation

Deguo Xia, Zihan Li, Haochen Zhao, Dong Xie, Yuyao Kong, Xiyan Liu, Jizhou Huang, Mengmeng Yang, Diange Yang

주저자 소속 - Tsinghua University

4. Scaling Self-Evolving Agents via Parametric Memory

Tao Ren, Weiyao Luo, Hui Yang, Rongzhi Zhu, Xiang Huang, ..., Bingxue Chou, Jieping Ye, Jiafeng Liang, Yongbin Li, Yijie Peng

주저자 소속 - Stanford University

5. Strabo: Declarative Specification and Implementation of Agentic Interaction Protocols

Samuel H. Christie V, Amit K. Chopra, Munindar P. Singh

주저자 소속 - Stanford University

6. SMAC-Talk: A Natural Language Extension of the StarCraft Multi-Agent Challenge for Large Language Models

Joel Sol, Homayoun Najjaran

주저자 소속 - Faculty of Engineering and Computer Science

7. Cascading Hallucination in Agentic RAG: The CHARM Framework for Detection and Mitigation

Saroj Mishra

주저자 소속 - University of North Dakota

8. Exploring Cross-Scenario Generality of Agentic Memory Systems: Diagnostics and a Strong Baseline

Zhikai Chen, Jialiang Gu, Junyu Yin, Xianxuan Long, Shenglai Zeng, Xiaoze Liu, Kai Guo, Keren Zhou, Jiliang Tang

주저자 소속 - Michigan State University

9. The Meta-Agent Challenge: Are Current Agents Capable of Autonomous Agent Development?

Xinyu Lu, Tianshu Wang, Pengbo Wang, zujie wen, Zhiqiang Zhang, ..., Boxi Cao, Yaojie Lu, Hongyu Lin, Xianpei Han, Le Sun

주저자 소속 - Chinese Information Processing Laboratory, Institute of Software, Chinese Academy of Sciences

10. AgentJet: A Flexible Swarm Training Framework for Agentic Reinforcement Learning

Qingxu Fu, Boyin Liu, Shuchang Tao, Zhaoyang Liu, Bolin Ding

주저자 소속 - Tongyi Lab, Alibaba Group
나머지 62편 보기

12. TIBlender: Early-Warning Threat Intelligence from Cross-Platform Social Media Evidence

Hiroki Nakano, Takashi Koide, Daiki Chiba

주저자 소속 - NTT Security Holdings Corporation & NTT, Inc., Tokyo, Japan

24. Imbuing Large Language Models with Bidirectional Logic for Robust Chain Repair

Zehua Cheng, Wei Dai, Jiahao Sun, Thomas Lukasiewicz

주저자 소속 - Department of Computer Science, University of Oxford, UK

25. Consensus is Strategically Insufficient: Reasoning-Trace Disagreement as a Knowledge-Representation Signal

Michał Wawer, Jarosław A. Chudziak

주저자 소속 - Laboratory of The New Ethos, Warsaw University of Technology, Warsaw, Poland

29. SMADE-IE: Sparse Multi-Agent Framework with Evidence-Driven Debate for Zero-Shot Information Extraction

Kenfeng Huang, Yi Cai, Xin Wu, Zikun Deng, Li Yuan

주저자 소속 - School of Software Engineering, South China University of Technology, Guangzhou, China

35. Entity Binding Failures in Speech LLM Reasoning: Diagnosis and Chain-of-Thought Intervention

Ming-Hao Hsu, Xiaohai Tian, Jun Zhang, Zhizheng Wu

주저자 소속 - School of Data Science, The Chinese University of Hong Kong, Shenzhen, China

44. Tree-Based Formalization of Multi-Agent Complementarity in Human-AI Interactions

Andrea Ferrario

주저자 소속 - Institute of Biomedical Ethics and History of Medicine, University of Zurich, Zurich, Switzerland

54. Selection-Aware Diagnostics for Chain-of-Thought Answer Hijacking

Jianwei Tai

주저자 소속 - School of Internet, Anhui University

59. CoPark: Learning Reactive Parking via Self-Play

Jiarong Wei, Yanxing Chen, Sinuo Song, Yin Wu, Anna Rehr, Abhinav Valada

주저자 소속 - Department of Computer Science, University of Freiburg, Germany

66. Invariant Gradient Alignment for Robust Reasoning Distillation

Zehua Cheng, Wei Dai, Jiahao Sun

주저자 소속 - University of Oxford, Oxford, UK

67. QO-Bench: Diagnosing Query-Operator-Preserving Retrieval over Typed Event Tuples

Mengao Zhang, Xiang Yang, Chang Liu, Tianhui Tan, Ke-wei Huang

주저자 소속 - Asian Institute of Digital Finance, National University of Singapore

71. Bayesian learning for the stochastic shortest path problem

Chon Wai Ho, Sumeetpal S. Singh, Jiaqi Guo

주저자 소속 - Department of Engineering, University of Cambridge, UK