Paper Plaza - AcademicHub

01.

arXiv (CS.LG) 2026-06-16 DOI: arXiv:2606.16273

Generative Modeling on Metric Graphs via Neural Optimal Transport

Authors:

Alessandro Micheli ↗Yueqi Cao ↗Anthea Monod ↗Samir Bhatt ↗

arXiv:2606.16273v1 Announce Type: cross Abstract: We introduce, to our knowledge, the first deep generative modeling framework for probability distributions continuously supported on compact metric graphs. Given source and target measures on a metric graph, our method embeds the graph into a smooth ambient space, solves an entropic Kantorovich problem via a neural semidual parameterization, and projects generated samples back onto the original graph. We study two embedded geometries: an extrinsic Euclidean realization and the intrinsic tropical Abel–Jacobi embedding into the Jacobian torus. In both cases, the resulting generator is graph-supported by construction. We prove that, in the joint limit of increasing neural expressivity, the learned generator converges weakly to a valid transport coupling between the original graph measures. Empirically, across a range of geometrically distinct graphs, our method matches or improves upon heuristic transport baselines based on discrete graph OT, while scaling more favorably. Finally, we demonstrate scalability on real-world urban mobility data by training our model on one million Uber pickup locations in Manhattan, New York City.

Read & Discuss → View Source →

02.

arXiv (CS.AI) 2026-06-12 DOI: arXiv:2606.12736

Benchmarking AI Agents for Addressing Scientific Challenges Across Scales

Authors:

Tianyu Liu ↗Allen Xin Wang ↗Antonia Panescu ↗Lisa Xinyi Chen ↗Wenxin Long ↗Xinyu Wei ↗Yueqian Jing ↗Ziyao Zeng ↗Jihang Chen ↗Sihan Jiang ↗Ziqing Wang ↗Siyi Gu ↗…

arXiv:2606.12736v1 Announce Type: new Abstract: AI agents are increasingly being developed to accelerate scientific discovery, yet their practical capabilities in real research settings remain poorly understood. Existing benchmarks for AI agents rarely capture the complexity, heterogeneity, and extended reasoning required by scientific work, whereas benchmarks for scientific tasks often reduce research to static, direct problems and provide limited support for interactive evaluation. Here, we introduce SciAgentArena, a systematic benchmark for evaluating AI agents in real-world scientific research scenarios drawn from emerging needs across multiple domains. SciAgentArena comprises approximately 200 tasks with stepwise verification and an interactive, agent-agnostic environment for assessing diverse AI agents. Using this benchmark, we find that current agents can contribute effectively to well-specified data-analysis workflows, particularly when the task structure and evaluation criteria are clear. However, their performance remains uneven across scientific contexts: agents struggle to generate genuinely novel insights, sustain self-directed exploration, and formulate robust solutions for open-ended research questions. We further characterize common failure modes across agents and identify opportunities for improving their reliability, autonomy, and scientific reasoning. Together, SciAgentArena provides a practical framework for measuring progress in AI agents for science and for guiding the design of future agents capable of addressing complex scientific challenges. Full codes, tasks, and datasets can be accessed via this link: https://sciagentarena.github.io/.

Read & Discuss → View Source →

Explore the Frontier of Global Academia

Generative Modeling on Metric Graphs via Neural Optimal Transport

Benchmarking AI Agents for Addressing Scientific Challenges Across Scales