Cortiq

· 110편

Agentic AI

해당 날짜의 arXiv 발표에서 선별한 랭킹 브리프입니다. Cortiq은 주제 적합도, 주저자 맥락, 공개 연구 신호를 함께 봅니다.

1. Do Agents Think Deeper? A Mechanistic Investigation of Layer-Wise Dynamics in Sequential Planning

Zhenyu Cui, Xiangzhong Luo

2. AsyncTool: Evaluating the Asynchronous Function Calling Capability under Multi-Task Scenarios

Kou Shi, Ziao Zhang, Shiting Huang, Avery Nie, Zhen Fang, Qiuchen Wang, Lin Chen, Huaian Chen, Zehui Chen, Feng Zhao

3. Got a Secret? LLM Agents Can't Keep It: Evaluating Privacy in Multi-Agent Systems

Aman Priyanshu, Supriti Vijay, Esha Pahwa

주저자 소속 - Foundation AI

4. A Unified Framework for the Evaluation of LLM Agentic Capabilities

Pengyu Zhu, Lijun Li, Yaxing Lyu, Qianxin Luo, Jingyi Yang, ..., Tingfeng Hui, Xinyu Yuan, Li Sun, Sen Su, Jing Shao

5. DisasterBench: Benchmarking LLM Planning under Typed Tool Interface Constraints

Zhitong Chen, Kai Yin, Weifeng Zhang, Zhiyuan Wang, Xiangjue Dong, Chengkai Liu, Zhewei Liu, Yiming Xiao, Ali Mostafavi, James Caverlee

6. A Query Engine for the Agents

Kenny Daniel

주저자 소속 - Hyperparam

7. When Does Memory Help Multi-Trajectory Inference for Tool-Use LLM Agents?

Xinzhe Li, Yaguang Tao

8. Do Agents Know What They Can't Do? Evaluating Feasibility Awareness in Tool-Using Agents

Liang Cheng, Mingsheng Cai, Jiuming Jiang, Luo Mai

9. Adaptive Multimodal Agents-Based Framework for Automatic Workflow Execution

Susanna Cifani, Mario Luca Bernardi, Marta Cimitile

10. ResearchMath-14K: Scaling Research-Level Mathematics via Agents

Guijin Son, Seungyeop Yi, Minju Gwak, Hyunwoo Ko, Wongi Jang, Youngjae Yu

나머지 100편 보기