Cortiq

· 103편

Agentic AI

해당 날짜의 arXiv 발표에서 선별한 랭킹 브리프입니다. Cortiq은 주제 적합도, 주저자 맥락, 공개 연구 신호를 함께 봅니다.

1. Towards Verifiable Multimodal Deep Research: A Multi-Agent Harness for Interleaved Report Generation

Chenghao Zhang, Guanting Dong, Yufan Liu, Tong Zhao, Zhicheng Dou

주저자 소속 - Gaoling School of Artificial Intelligence, Renmin University of China

2. The Importance of Out-of-Band Metadata for Safe Autonomous Agents: The Redpanda Agentic Data Plane

Tyler Akidau, Tyler Rockwood, Johannes Brüderl, Marc Millstone

주저자 소속 - Redpanda

3. VitalAgent: A Tool-Augmented Agent for Reactive and Proactive Physiological Monitoring over Wearable Health Data

Di Zhu, Yu Yvonne Wu, Hong Jia, Aaqib Saeed, Vassilis Kostakos, Ting Dang

주저자 소속 - The University of Melbourne, Australia

4. Training Deliberative Monitors for Black-Box Scheming Detection

Aditya Sinha, Akshat Naik, Victor Gillioz, Simon Storf, Kilian Merkelbach, Rich Barton-Cooper, Axel Højmark, Marius Hobbhahn

주저자 소속 - Independent

5. How Consistent Are LLM Agents? Measuring Behavioral Reproducibility in Multi-Step Tool-Calling Pipelines

Abel Yagubyan

주저자 소속 - Independent Researcher

6. MINDGAMES: A Live Arena for Evaluating Social and Strategic Reasoning in Multi-Agent LLMs

Kevin Wang, Anna Thöni, Benjamin Kempinski, Bobby Cheng, Jianzhu Yao, ..., Pramod Viswanath, Maria Polukarov, Cheston Tan, Tal Kachman, Atlas Wang

7. RoboWits: Unexpected Challenges for Robotic Creative Problem Solving

Chunru Lin, Hongxin Zhang, Fenghao Yu, Zhehuan Chen, Thomas L. Griffiths, Yejin Choi, David Held, Chuang Gan

주저자 소속 - University of Massachusetts Amherst

8. Hallucination Mitigation with Agentic AI, Nested Learning, and AI Sustainability via Semantic Caching

Diego Gosmar, Deborah A. Dahl

주저자 소속 - Tesisquare

9. GTA: Generating Long-Horizon Tasks for Web Agents at Scale

Tenghao Huang, Kung-Hsiang Huang, Prafulla Kumar Choubey, Yilun Zhou, Muhao Chen, Jonathan May, Chien-Sheng Wu

주저자 소속 - University of Southern California

10. PTCG-Bench: Can LLM Agents Master Pokémon Trading Card Game?

Dongdong Hua, Yifei Sun, Renhong Huang, Feng Gao, Chunping Wang, Yang Yang

주저자 소속 - Zhejiang University
나머지 93편 보기