2025 年 10 月 – Samuel 拾光札记

【鉴赏】Deepseek V3.2 Exp

2025-10-12 11:20

|

289

|

0

|

LLM Reports

811 字

|

4 分钟

标题: DeepSeek-V3.2-Exp: Boosting Long-Context Efficiency with DeepSeek Sparse Attention[1] Paper GitHub 使用 Deepseek Sparse Attention 在没有明显降低精度的情况下大幅降低推理成本。 1. 模型架构和 Deepseek V…

Attention LLM

【鉴赏】On-Policy Distillation

2025-10-06 9:36

|

311

|

0

|

ICLR

681 字

|

4 分钟

标题: On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes[1] FROM ICLR 2024 Google DeepMind arXiv 通用的 KD(Knowledge Distillation) 方法存在教师模型输出和学生模型输出分布…

Distillation LLM

【鉴赏】ACEBench: 评价大模型工具调用的 Benchmark

2025-10-03 14:14

|

490

|

0

|

arXiv

1446 字

|

6 分钟

标题: ACEBench: Who Wins the Match Point in Tool Usage?[1] FROM arXiv 2025 写在前面：这是一篇关于 ACEBench 相对于其他 Benchmark 的优势的文章，提及了 ACEBench 的数据构建方法和数据结构。笔者主要想借助这篇文章来介绍数据构建方式。虽然本文仅限于 AC…

Benchmark LLM

归档

分类

月度归档： 2025 年 10 月

归档

分类