📊 ArXiv 研究报告 (2026-03-22)

生成时间: 2026-03-22 08:32:37 数据源: ArXiv

📌 配置信息

关键词列表（共 27 个，总权重 27.0）

关键词	权重	类型
“Large Language Models” OR “LLMs” OR “Foundation Models”	1.0	主要
“Mixture of Experts” OR “MoE” OR “Sparse Models”	1.0	主要
“Small Language Models” OR “SLMs” OR “On-device AI”	1.0	主要
“Scaling Laws” AND “Data Quality”	1.0	主要
“Pre-training” OR “Continual Pre-training” OR “Domain Adaptation”	1.0	主要
“Post-training” OR “Supervised Fine-tuning” OR “SFT”	1.0	主要
“Instruction Tuning” OR “Alignment” OR “Value Alignment”	1.0	主要
“RLHF” OR “RLAIF” OR “Direct Preference Optimization” OR “DPO”	1.0	主要
“PEFT” OR “LoRA” OR “Parameter-efficient Fine-tuning”	1.0	主要
“Retrieval-Augmented Generation” OR “RAG” OR “Retrieval-Generation”	1.0	主要
“Context Window Extension” OR “Long Context LLMs”	1.0	主要
“KV Cache Compression” OR “Linear Attention” OR “FlashAttention”	1.0	主要
“Chain of Thought” OR “CoT Reasoning” OR “Multi-step Reasoning”	1.0	主要
“System 2 Thinking” OR “Slow Thinking” OR “In-depth Reasoning”	1.0	主要
“Monte Carlo Tree Search” OR “MCTS” AND “LLM”	1.0	主要
“Self-Correction” OR “Self-Improvement” OR “Self-Reflection”	1.0	主要
“LLM Agents” OR “Autonomous Agents” OR “Agentic Workflow”	1.0	主要
“Tool Use” OR “Function Calling” OR “API Tool Use”	1.0	主要
“Multi-agent Systems” OR “Agent Coordination”	1.0	主要
“Quantization” OR “Model Compression” OR “Low-bit Weights”	1.0	主要
“Speculative Decoding” OR “Inference Acceleration”	1.0	主要
“Hallucination Mitigation” OR “Factuality” OR “Truthfulness”	1.0	主要
“Mechanistic Interpretability” OR “Explainable AI”	1.0	主要
“World Models” AND “General World Models”	1.0	主要
“Model Merging” OR “Model Soups” OR “Weight Averaging”	1.0	主要
“In-context Learning” OR “Many-shot Learning”	1.0	主要
“AI for Science” OR “Bioinformatics” OR “Cheminformatics”	1.0	主要

评分设置

每个关键词最大分: 15
及格分公式: 5.0 + 0.8 × 总权重
当前及格分: 26.6

📈 论文统计

总抓取: 4 篇
及格论文: 0 篇 (0.0%)

📋 所有论文列表

作者: Quilee Simeon 期刊/来源: arxiv 发布日期: 2026-03-19 arXiv链接: http://arxiv.org/abs/2603.18497v1

评分: 0.0 / 26.6 ❌

评分详情

关键词	权重	相关度	得分
Large Language Models OR LLMs OR Foundation Models	0.0	0.0/10	0.0
Mixture of Experts OR MoE OR Sparse Models	0.0	0.0/10	0.0
Small Language Models OR SLMs OR On-device AI	0.0	0.0/10	0.0
Scaling Laws AND Data Quality	0.0	0.0/10	0.0
Pre-training OR Continual Pre-training OR Domain Adaptation	0.0	0.0/10	0.0
Post-training OR Supervised Fine-tuning OR SFT	0.0	0.0/10	0.0
Instruction Tuning OR Alignment OR Value Alignment	0.0	0.0/10	0.0
RLHF OR RLAIF OR Direct Preference Optimization OR DPO	0.0	0.0/10	0.0
PEFT OR LoRA OR Parameter-efficient Fine-tuning	0.0	0.0/10	0.0
Retrieval-Augmented Generation OR RAG OR Retrieval-Generation	0.0	0.0/10	0.0
Context Window Extension OR Long Context LLMs	0.0	0.0/10	0.0
KV Cache Compression OR Linear Attention OR FlashAttention	0.0	0.0/10	0.0
Chain of Thought OR CoT Reasoning OR Multi-step Reasoning	0.0	0.0/10	0.0
System 2 Thinking OR Slow Thinking OR In-depth Reasoning	0.0	0.0/10	0.0
Monte Carlo Tree Search OR MCTS AND LLM	0.0	0.0/10	0.0
Self-Correction OR Self-Improvement OR Self-Reflection	0.0	0.0/10	0.0
LLM Agents OR Autonomous Agents OR Agentic Workflow	0.0	0.0/10	0.0
Tool Use OR Function Calling OR API Tool Use	0.0	0.0/10	0.0
Multi-agent Systems OR Agent Coordination	0.0	0.0/10	0.0
Quantization OR Model Compression OR Low-bit Weights	0.0	0.0/10	0.0
Speculative Decoding OR Inference Acceleration	0.0	0.0/10	0.0
Hallucination Mitigation OR Factuality OR Truthfulness	0.0	0.0/10	0.0
Mechanistic Interpretability OR Explainable AI	0.0	0.0/10	0.0
World Models AND General World Models	0.0	0.0/10	0.0
Model Merging OR Model Soups OR Weight Averaging	0.0	0.0/10	0.0
In-context Learning OR Many-shot Learning	0.0	0.0/10	0.0
AI for Science OR Bioinformatics OR Cheminformatics	0.0	0.0/10	0.0

评分理由: 论文研究神经科学中的神经网络连接性推断问题，使用协方差方法和Granger因果性优化，属于传统神经网络和计算神经科学领域。所有关键词均涉及大模型、深度学习技术原理或AI在科学领域的应用，但论文完全不涉及这些主题，没有提到任何大模型、语言模型、训练方法、推理技术、对齐、压缩、幻觉缓解、可解释性、AI for Science等概念，因此所有关键词相关度均为0。

!!! tip deepseek-chat TL;DR

该论文提出了一种基于协方差和Granger因果性优化的方法，从稀疏部分测量中推断循环神经网络的连接性，发现线性近似作为隐式正则化优于已知非线性性的oracle估计器。

摘要翻译

从非完整观测数据中推断神经回路连接性是神经科学领域的核心挑战。本文提出一种基于协方差的方法，用于通过多轮记录会话中稀疏、局部的测量数据估计递归神经网络的权重矩阵。通过在不同会话中积累观测到的神经元子集之间的协方差估计（其中各会话观测的神经元子集不同），我们实现了无需同时记录全部神经元即可重建完整连接矩阵。借助格兰杰因果性优化步骤，通过投影梯度下降法施加生物学约束。通过对模拟小型脑回路合成网络的系统实验，我们揭示了一个根本性的控制-估计权衡规律：电刺激有助于提升可识别性，但会干扰内在动力学特性，其最优平衡点取决于测量密度。研究发现，“错误”的线性近似实际上起到了隐式正则化作用——在所有操作区间均优于已知非线性特性的理想估计器——并通过斯坦因-普莱斯恒等式给出了精确的理论表征。

摘要 (Abstract)

Inferring the connectivity of neural circuits from incomplete observations is a fundamental challenge in neuroscience. We present a covariance-based method for estimating the weight matrix of a recurrent neural network from sparse, partial measurements across multiple recording sessions. By accumulating pairwise covariance estimates across sessions where different subsets of neurons are observed, we reconstruct the full connectivity matrix without requiring simultaneous recording of all neurons. A Granger-causality refinement step enforces biological constraints via projected gradient descent. Through systematic experiments on synthetic networks modeling small brain circuits, we characterize a fundamental control-estimation tradeoff: stimulation aids identifiability but disrupts intrinsic dynamics, with the optimal level depending on measurement density. We discover that the ``incorrect’’ linear approximation acts as implicit regularization – outperforming the oracle estimator with known nonlinearity at all operating regimes – and provide an exact characterization via the Stein–Price identity.

关键词: neural connectivity, covariance-based method, Granger-causality, recurrent neural network, sparse measurements, implicit regularization, Stein-Price identity, control-estimation tradeoff

2. ❌ CAPSUL: A Comprehensive Human Protein Benchmark for Subcellular Localization

作者: Yicheng Hu, Xinyu Lin, Shulin Li, Wenjie Wang, Fengbin Zhu, Fuli Feng 期刊/来源: arxiv 发布日期: 2026-03-19 arXiv链接: http://arxiv.org/abs/2603.18571v1

评分: 0.0 / 26.6 ❌

评分详情

关键词	权重	相关度	得分
Large Language Models OR LLMs OR Foundation Models	0.0	0.0/10	0.0
Mixture of Experts OR MoE OR Sparse Models	0.0	0.0/10	0.0
Small Language Models OR SLMs OR On-device AI	0.0	0.0/10	0.0
Scaling Laws AND Data Quality	0.0	0.0/10	0.0
Pre-training OR Continual Pre-training OR Domain Adaptation	0.0	0.0/10	0.0
Post-training OR Supervised Fine-tuning OR SFT	0.0	0.0/10	0.0
Instruction Tuning OR Alignment OR Value Alignment	0.0	0.0/10	0.0
RLHF OR RLAIF OR Direct Preference Optimization OR DPO	0.0	0.0/10	0.0
PEFT OR LoRA OR Parameter-efficient Fine-tuning	0.0	0.0/10	0.0
Retrieval-Augmented Generation OR RAG OR Retrieval-Generation	0.0	0.0/10	0.0
Context Window Extension OR Long Context LLMs	0.0	0.0/10	0.0
KV Cache Compression OR Linear Attention OR FlashAttention	0.0	0.0/10	0.0
Chain of Thought OR CoT Reasoning OR Multi-step Reasoning	0.0	0.0/10	0.0
System 2 Thinking OR Slow Thinking OR In-depth Reasoning	0.0	0.0/10	0.0
Monte Carlo Tree Search OR MCTS AND LLM	0.0	0.0/10	0.0
Self-Correction OR Self-Improvement OR Self-Reflection	0.0	0.0/10	0.0
LLM Agents OR Autonomous Agents OR Agentic Workflow	0.0	0.0/10	0.0
Tool Use OR Function Calling OR API Tool Use	0.0	0.0/10	0.0
Multi-agent Systems OR Agent Coordination	0.0	0.0/10	0.0
Quantization OR Model Compression OR Low-bit Weights	0.0	0.0/10	0.0
Speculative Decoding OR Inference Acceleration	0.0	0.0/10	0.0
Hallucination Mitigation OR Factuality OR Truthfulness	0.0	0.0/10	0.0
Mechanistic Interpretability OR Explainable AI	0.0	5.0/10	0.0
World Models AND General World Models	0.0	0.0/10	0.0
Model Merging OR Model Soups OR Weight Averaging	0.0	0.0/10	0.0
In-context Learning OR Many-shot Learning	0.0	0.0/10	0.0
AI for Science OR Bioinformatics OR Cheminformatics	0.0	10.0/10	0.0

评分理由: 该论文专注于生物信息学领域，提出了一个用于蛋白质亚细胞定位的基准数据集CAPSUL，并评估了基于序列和结构的模型。论文的核心贡献是数据集构建和生物信息学应用，与绝大多数大模型技术关键词（如LLM、MoE、SFT、RLHF、RAG、量化等）完全无关。唯一相关的关键词是’AI for Science OR Bioinformatics OR Cheminformatics’，因为论文属于AI在生物信息学领域的应用，得10分。‘Mechanistic Interpretability OR Explainable AI’得5分，因为论文提到了通过注意力机制发现α-螺旋定位模式，展示了结构方法的可解释性，但这并非论文核心。其他关键词均未涉及。

!!! tip deepseek-chat TL;DR

该研究针对蛋白质亚细胞定位任务缺乏综合3D结构数据的问题，提出了一个整合多种3D结构表示和精细亚细胞定位注释的人类蛋白质基准数据集CAPSUL，并通过评估序列和结构模型展示了结构特征的重要性，同时通过案例研究发现了决定性的α-螺旋定位模式，为细胞生物学数据驱动发现铺平了道路。

摘要翻译

亚细胞定位是药物靶点识别与功能注释的关键生物学任务。尽管生物学界已认识到亚细胞定位与蛋白质结构密切相关，但现有数据集均未能提供具备详细亚细胞定位注释的全面三维结构信息，这严重阻碍了基于结构的模型在此任务中的应用。为填补这一空白，我们提出了名为 $\mathbf{CAPSUL}$ 的新基准——一个面向亚细胞定位的综合性人类蛋白质基准（$\mathbf{C}$omprehensive hum$\mathbf{A}$n $\mathbf{P}$rotein benchmark for $\mathbf{SU}$bcellular $\mathbf{L}$ocalization）。该基准数据集整合了多样化的三维结构表征与经过领域专家精心标注的细粒度亚细胞定位信息。我们使用多种先进的基于序列和基于结构的模型对该基准进行评估，揭示了引入结构特征对此任务的重要性。此外，我们探索了重加权与单标签分类策略，以促进未来基于结构的方法在此任务中的研究。最后，我们通过对高尔基体（Golgi apparatus）的案例研究，展示了基于结构的方法强大的可解释性：通过注意力机制发现了一个决定性的定位模式——$α$螺旋（$α$-helix），这证明了该方法在弥合直观生物学可解释性差距方面的潜力，并为细胞生物学中的数据驱动发现铺平了道路。

摘要 (Abstract)

Subcellular localization is a crucial biological task for drug target identification and function annotation. Although it has been biologically realized that subcellular localization is closely associated with protein structure, no existing dataset offers comprehensive 3D structural information with detailed subcellular localization annotations, thus severely hindering the application of promising structure-based models on this task. To address this gap, we introduce a new benchmark called $\mathbf{CAPSUL}$, a $\mathbf{C}$omprehensive hum$\mathbf{A}$n $\mathbf{P}$rotein benchmark for $\mathbf{SU}$bcellular $\mathbf{L}$ocalization. It features a dataset that integrates diverse 3D structural representations with fine-grained subcellular localization annotations carefully curated by domain experts. We evaluate this benchmark using a variety of state-of-the-art sequence-based and structure-based models, showcasing the importance of involving structural features in this task. Furthermore, we explore reweighting and single-label classification strategies to facilitate future investigation on structure-based methods for this task. Lastly, we showcase the powerful interpretability of structure-based methods through a case study on the Golgi apparatus, where we discover a decisive localization pattern $α$-helix from attention mechanisms, demonstrating the potential for bridging the gap with intuitive biological interpretability and paving the way for data-driven discoveries in cell biology.

关键词: subcellular localization, protein benchmark, 3D structural information, structure-based models, attention mechanisms, biological interpretability, data-driven discovery, cell biology

3. ❌ RAFT-UP: Robust Alignment for Spatial Transcriptomics with Explicit Control of Spatial Distortion

作者: Yaqi Wu, Jingfeng Wang, Xin Maizie Zhou, Yanxiang Zhao, Zixuan Cang 期刊/来源: arxiv 发布日期: 2026-03-18 arXiv链接: http://arxiv.org/abs/2603.18249v1

评分: 0.0 / 26.6 ❌

评分详情

关键词	权重	相关度	得分
Large Language Models OR LLMs OR Foundation Models	0.0	0.0/10	0.0
Mixture of Experts OR MoE OR Sparse Models	0.0	0.0/10	0.0
Small Language Models OR SLMs OR On-device AI	0.0	0.0/10	0.0
Scaling Laws AND Data Quality	0.0	0.0/10	0.0
Pre-training OR Continual Pre-training OR Domain Adaptation	0.0	0.0/10	0.0
Post-training OR Supervised Fine-tuning OR SFT	0.0	0.0/10	0.0
Instruction Tuning OR Alignment OR Value Alignment	0.0	0.0/10	0.0
RLHF OR RLAIF OR Direct Preference Optimization OR DPO	0.0	0.0/10	0.0
PEFT OR LoRA OR Parameter-efficient Fine-tuning	0.0	0.0/10	0.0
Retrieval-Augmented Generation OR RAG OR Retrieval-Generation	0.0	0.0/10	0.0
Context Window Extension OR Long Context LLMs	0.0	0.0/10	0.0
KV Cache Compression OR Linear Attention OR FlashAttention	0.0	0.0/10	0.0
Chain of Thought OR CoT Reasoning OR Multi-step Reasoning	0.0	0.0/10	0.0
System 2 Thinking OR Slow Thinking OR In-depth Reasoning	0.0	0.0/10	0.0
Monte Carlo Tree Search OR MCTS AND LLM	0.0	0.0/10	0.0
Self-Correction OR Self-Improvement OR Self-Reflection	0.0	0.0/10	0.0
LLM Agents OR Autonomous Agents OR Agentic Workflow	0.0	0.0/10	0.0
Tool Use OR Function Calling OR API Tool Use	0.0	0.0/10	0.0
Multi-agent Systems OR Agent Coordination	0.0	0.0/10	0.0
Quantization OR Model Compression OR Low-bit Weights	0.0	0.0/10	0.0
Speculative Decoding OR Inference Acceleration	0.0	0.0/10	0.0
Hallucination Mitigation OR Factuality OR Truthfulness	0.0	0.0/10	0.0
Mechanistic Interpretability OR Explainable AI	0.0	0.0/10	0.0
World Models AND General World Models	0.0	0.0/10	0.0
Model Merging OR Model Soups OR Weight Averaging	0.0	0.0/10	0.0
In-context Learning OR Many-shot Learning	0.0	0.0/10	0.0
AI for Science OR Bioinformatics OR Cheminformatics	0.0	5.0/10	0.0

评分理由: 论文专注于空间转录组学（ST）数据对齐的生物信息学方法，提出了一种基于融合监督Gromov-Wasserstein最优传输框架的工具RAFT-UP，用于解决空间转录组学切片对齐中的空间失真控制和生物合理性匹配问题。论文内容与绝大多数关键词（涉及大模型技术、训练方法、推理优化、智能体等）完全无关，仅与最后一个关键词’AI for Science OR Bioinformatics OR Cheminformatics’有一定关联，因为该研究属于生物信息学（Bioinformatics）领域，是AI在科学（具体是生物医学）中的应用。但论文本身并未涉及大模型或深度学习技术，其核心方法是基于最优传输的数学框架，而非基于神经网络或大语言模型。因此，仅对最后一个关键词给予5分（有一定关联），其余均为0分（完全无关）。

!!! tip deepseek-chat TL;DR

该论文针对空间转录组学数据对齐中空间失真控制不足和生物匹配不合理的问题，提出了一个基于融合监督Gromov-Wasserstein最优传输框架的工具RAFT-UP，实现了对空间距离保持的显式控制，并在对齐准确性和下游应用中表现出色。

摘要翻译

空间转录组学（Spatial Transcriptomics，ST）能够在保留空间坐标的前提下，对组织切片中的基因表达进行全景分析。由于当前ST技术通常针对二维组织切片进行检测，整合并对齐来自同一三维组织不同区域或不同条件下样本的切片，能够实现揭示三维组织结构及条件相关空间模式的分析。目前仍存在两大挑战：首先，需要对空间形变进行可解释且灵活的控制，因为刚性变换可能限制过强，而高度可形变的映射又可能任意扭曲空间邻近关系；其次，尤其是在切片仅部分重叠的情况下，还需要实现生物学上合理的匹配。为此，我们提出了RAFT-UP，一种稳健的空间转录组对齐工具，它通过融合监督的格罗莫夫-瓦瑟斯坦（Fused supervised Gromov-Wasserstein，FsGW）最优传输框架，提供了对空间距离保持的显式控制。FsGW融合了表达信息与空间信息，引入点级约束以避免生物学上不合理的匹配，并强制执行成对距离一致性约束，防止在两组点对的空间距离超出指定容差范围时建立映射关系。我们证明，RAFT-UP能够准确对齐同一组织不同区域的切片以及不同样本的切片。基准测试表明，RAFT-UP在保持与先进方法相当的点标签匹配精度的同时，显著提升了空间距离的保持能力。最后，我们通过两个空间约束下的下游应用展示了RAFT-UP的效用，包括发育中小鼠中脑的时空图谱构建以及跨切片的细胞间通讯比较分析。RAFT-UP已作为开源软件发布。

摘要 (Abstract)

Spatial transcriptomics (ST) profiles gene expression across a tissue section while preserving the spatial coordinates. Because current ST technologies typically profile two-dimensional tissue slices, integrating and aligning slices from different regions of the same three-dimensional tissue or from samples under different conditions enables analyses that reveal 3D organization and condition-associated spatial patterns. Two major challenges remain. First, interpretable and flexible control over spatial distortion is needed because rigid transformations can be overly restrictive, whereas highly deformable mappings may arbitrarily distort spatial proximity. Second, biologically plausible matching is also needed, especially when the slices overlap partially. Here, we introduce RAFT-UP, a tool for robust ST alignment that provides explicit control over spatial distance preservation through a fused supervised Gromov-Wasserstein (FsGW) optimal transport framework. FsGW combines expression and spatial information, incorporates spot-wise constraints to discourage biologically implausible matches, and enforces a pairwise distance-consistency constraint that prevents mapping two pairs of spots when their spatial distances differ beyond a specified tolerance. We demonstrate that RAFT-UP accurately aligns slices from different regions of the same tissue and slices from different samples. Benchmarking shows that RAFT-UP improves spatial distance preservation while achieving spot label matching accuracy comparable to state-of-the-art methods. Finally, we demonstrate RAFT-UP on two spatially constrained downstream applications, including spatiotemporal mapping of developing mouse midbrain and comparative cross-slice analysis of cell-cell communication. RAFT-UP is available as open-source software.

关键词: Spatial transcriptomics, Alignment, Gromov-Wasserstein optimal transport, Spatial distortion control, Bioinformatics, Tissue slice integration, Cell-cell communication analysis, Open-source software

4. ❌ SCALE:Scalable Conditional Atlas-Level Endpoint transport for virtual cell perturbation prediction

作者: Shuizhou Chen, Lang Yu, Kedu Jin, Songming Zhang, Hao Wu, Wenxuan Huang, Sheng Xu, Quan Qian, Qin Chen, Lei Bai, Siqi Sun, Zhangyang Gao 期刊/来源: arxiv 发布日期: 2026-03-18 arXiv链接: http://arxiv.org/abs/2603.17380v2

评分: 0.0 / 26.6 ❌

评分详情

关键词	权重	相关度	得分
Large Language Models OR LLMs OR Foundation Models	0.0	8.0/10	0.0
Mixture of Experts OR MoE OR Sparse Models	0.0	0.0/10	0.0
Small Language Models OR SLMs OR On-device AI	0.0	0.0/10	0.0
Scaling Laws AND Data Quality	0.0	5.0/10	0.0
Pre-training OR Continual Pre-training OR Domain Adaptation	0.0	8.0/10	0.0
Post-training OR Supervised Fine-tuning OR SFT	0.0	0.0/10	0.0
Instruction Tuning OR Alignment OR Value Alignment	0.0	0.0/10	0.0
RLHF OR RLAIF OR Direct Preference Optimization OR DPO	0.0	0.0/10	0.0
PEFT OR LoRA OR Parameter-efficient Fine-tuning	0.0	0.0/10	0.0
Retrieval-Augmented Generation OR RAG OR Retrieval-Generation	0.0	0.0/10	0.0
Context Window Extension OR Long Context LLMs	0.0	0.0/10	0.0
KV Cache Compression OR Linear Attention OR FlashAttention	0.0	0.0/10	0.0
Chain of Thought OR CoT Reasoning OR Multi-step Reasoning	0.0	0.0/10	0.0
System 2 Thinking OR Slow Thinking OR In-depth Reasoning	0.0	0.0/10	0.0
Monte Carlo Tree Search OR MCTS AND LLM	0.0	0.0/10	0.0
Self-Correction OR Self-Improvement OR Self-Reflection	0.0	0.0/10	0.0
LLM Agents OR Autonomous Agents OR Agentic Workflow	0.0	0.0/10	0.0
Tool Use OR Function Calling OR API Tool Use	0.0	0.0/10	0.0
Multi-agent Systems OR Agent Coordination	0.0	0.0/10	0.0
Quantization OR Model Compression OR Low-bit Weights	0.0	0.0/10	0.0
Speculative Decoding OR Inference Acceleration	0.0	5.0/10	0.0
Hallucination Mitigation OR Factuality OR Truthfulness	0.0	0.0/10	0.0
Mechanistic Interpretability OR Explainable AI	0.0	0.0/10	0.0
World Models AND General World Models	0.0	0.0/10	0.0
Model Merging OR Model Soups OR Weight Averaging	0.0	0.0/10	0.0
In-context Learning OR Many-shot Learning	0.0	0.0/10	0.0
AI for Science OR Bioinformatics OR Cheminformatics	0.0	10.0/10	0.0

评分理由: 论文提出SCALE，一个用于虚拟细胞扰动预测的大规模基础模型，属于AI for Science（生物信息学）领域，因此该关键词高度相关（10分）。论文明确提到使用LLaMA-based cellular encoding，属于大模型技术应用（8分）。模型涉及预训练和基础设施优化，与Pre-training和Scaling Laws有一定关联（8分和5分）。BioNeMo框架提升了训练和推理效率，与Inference Acceleration相关（5分）。其他关键词如MoE、SFT、RAG等未在摘要中提及，评为0分。

!!! tip deepseek-chat TL;DR

该论文提出SCALE，一个用于虚拟细胞扰动预测的大规模基础模型，通过优化训练推理框架、采用条件传输建模和生物意义评估协议，显著提升了预测速度和生物指标性能。

摘要翻译

虚拟细胞模型旨在通过单细胞测量预测细胞如何响应遗传、化学或细胞因子扰动，从而实现计算机模拟实验。然而在实践中，大规模扰动预测仍受限于三个相互关联的瓶颈：低效的训练与推断流程、高维稀疏表达空间中的建模不稳定性，以及过度强调类重建精度而低估生物学保真度的评估方案。本研究提出了专门用于虚拟细胞扰动预测的大规模基础模型SCALE，以协同解决上述局限。首先，我们构建了基于BioNeMo的训练与推断框架，显著提升了数据吞吐量、分布式扩展性和部署效率，在匹配系统设置下相比先前最优（SOTA）流程实现了预训练12.51倍加速和推断1.29倍加速。其次，我们将扰动预测形式化为条件传输问题，并通过结合基于LLaMA的细胞编码与面向端点的监督机制，构建了集合感知流架构来实现该框架。该设计实现了更稳定的训练效果和更强的扰动效应恢复能力。第三，我们在Tahoe-100M数据集上采用以生物学意义指标为核心（而非仅关注重建精度）的严格细胞级评估方案进行模型验证。在此基准测试中，我们的模型相较于最优基准（STATE）将PDCorr提升了12.02%，DE重叠度（DE Overlap）提升了10.66%。这些结果表明，推进虚拟细胞研究不仅需要更优的生成目标，还需要对可扩展基础设施、稳定传输建模与生物学可信评估进行协同设计。

摘要 (Abstract)

Virtual cell models aim to enable in silico experimentation by predicting how cells respond to genetic, chemical, or cytokine perturbations from single-cell measurements. In practice, however, large-scale perturbation prediction remains constrained by three coupled bottlenecks: inefficient training and inference pipelines, unstable modeling in high-dimensional sparse expression space, and evaluation protocols that overemphasize reconstruction-like accuracy while underestimating biological fidelity. In this work we present a specialized large-scale foundation model SCALE for virtual cell perturbation prediction that addresses the above limitations jointly. First, we build a BioNeMo-based training and inference framework that substantially improves data throughput, distributed scalability, and deployment efficiency, yielding 12.51* speedup on pretrain and 1.29* on inference over the prior SOTA pipeline under matched system settings. Second, we formulate perturbation prediction as conditional transport and implement it with a set-aware flow architecture that couples LLaMA-based cellular encoding with endpoint-oriented supervision. This design yields more stable training and stronger recovery of perturbation effects. Third, we evaluate the model on Tahoe-100M using a rigorous cell-level protocol centered on biologically meaningful metrics rather than reconstruction alone. On this benchmark, our model improves PDCorr by 12.02% and DE Overlap by 10.66% over STATE. Together, these results suggest that advancing virtual cells requires not only better generative objectives, but also the co-design of scalable infrastructure, stable transport modeling, and biologically faithful evaluation.

关键词: virtual cell perturbation prediction, large-scale foundation model, BioNeMo-based framework, conditional transport, LLaMA-based encoding, scalable infrastructure, biological fidelity, Tahoe-100M benchmark

Token 消耗统计

总计: 11,364 tokens（输入 7,256 / 输出 4,108）

📊 ArXiv 研究报告 (2026-03-22)#

📌 配置信息#

关键词列表（共 27 个，总权重 27.0）#

评分设置#

📈 论文统计#

📋 所有论文列表#

1. ❌ Recovering Sparse Neural Connectivity from Partial Measurements: A Covariance-Based Approach with Granger-Causality Refinement#

2. ❌ CAPSUL: A Comprehensive Human Protein Benchmark for Subcellular Localization#

3. ❌ RAFT-UP: Robust Alignment for Spatial Transcriptomics with Explicit Control of Spatial Distortion#

4. ❌ SCALE:Scalable Conditional Atlas-Level Endpoint transport for virtual cell perturbation prediction#

Token 消耗统计#

📊 ArXiv 研究报告 (2026-03-22)

📌 配置信息

关键词列表（共 27 个，总权重 27.0）

评分设置

📈 论文统计

📋 所有论文列表

1. ❌ Recovering Sparse Neural Connectivity from Partial Measurements: A Covariance-Based Approach with Granger-Causality Refinement

2. ❌ CAPSUL: A Comprehensive Human Protein Benchmark for Subcellular Localization

3. ❌ RAFT-UP: Robust Alignment for Spatial Transcriptomics with Explicit Control of Spatial Distortion

4. ❌ SCALE:Scalable Conditional Atlas-Level Endpoint transport for virtual cell perturbation prediction

Token 消耗统计