🗞️ 学术与技术日报 - 2026-03-28¶

专注arXiv最新研究 + GitHub热门项目 + 当日问答总结

📚 arXiv最新AI研究¶

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling
作者：Yawen Luo, Xiaoyu Shi, Junhao Zhuang
分类：cs.CV
摘要：Multi-shot video generation is crucial for long narrative storytelling, yet current bidirectional architectures suffer from limited interactivity and high latency. We propose ShotStream, a novel causal multi-shot architecture that enables interactive storytelling and efficient on-the-fly frame gener...
论文链接
Less Gaussians, Texture More: 4K Feed-Forward Textured Splatting
作者：Yixing Lao, Xuyang Bai, Xiaoyang Wu
分类：cs.CV
摘要：Existing feed-forward 3D Gaussian Splatting methods predict pixel-aligned primitives, leading to a quadratic growth in primitive count as resolution increases. This fundamentally limits their scalability, making high-resolution synthesis such as 4K intractable. We introduce LGTM (Less Gaussians, Tex...
论文链接
MuRF: Unlocking the Multi-Scale Potential of Vision Foundation Models
作者：Bocheng Zou, Mu Cai, Mark Stanley
分类：cs.CV
摘要：Vision Foundation Models (VFMs) have become the cornerstone of modern computer vision, offering robust representations across a wide array of tasks. While recent advances allow these models to handle varying input sizes during training, inference typically remains restricted to a single, fixed scale...
论文链接

Vega: Learning to Drive with Natural Language Instructions
作者：Sicheng Zuo, Yuxuan Li, Wenzhao Zheng
分类：cs.CV, cs.AI
摘要：Vision-language-action models have reshaped autonomous driving to incorporate languages into the decision-making process. However, most existing pipelines only utilize the language modality for scene descriptions or reasoning and lack the flexibility to follow diverse user instructions for personali...
论文链接
Drive My Way: Preference Alignment of Vision-Language-Action Model for Personalized Driving
作者：Zehao Wang, Huaide Jiang, Shuaiwu Dong
分类：cs.RO, cs.AI
摘要：Human driving behavior is inherently personal, which is shaped by long-term habits and influenced by short-term intentions. Individuals differ in how they accelerate, brake, merge, yield, and overtake across diverse situations. However, existing end-to-end autonomous driving systems either optimize ...
论文链接

Vega: Learning to Drive with Natural Language Instructions
作者：Sicheng Zuo, Yuxuan Li, Wenzhao Zheng
分类：cs.CV, cs.AI
摘要：Vision-language-action models have reshaped autonomous driving to incorporate languages into the decision-making process. However, most existing pipelines only utilize the language modality for scene descriptions or reasoning and lack the flexibility to follow diverse user instructions for personali...
论文链接
Drive My Way: Preference Alignment of Vision-Language-Action Model for Personalized Driving
作者：Zehao Wang, Huaide Jiang, Shuaiwu Dong
分类：cs.RO, cs.AI
摘要：Human driving behavior is inherently personal, which is shaped by long-term habits and influenced by short-term intentions. Individuals differ in how they accelerate, brake, merge, yield, and overtake across diverse situations. However, existing end-to-end autonomous driving systems either optimize ...
论文链接

transformers ⭐112000 (Python)
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
项目地址

onnxruntime ⭐11200 (C++)
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
项目地址
onnxruntime ⭐11200 (C++)
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
项目地址

主要收获：

学习进展：

• 技术研究：跟踪AI前沿论文和开源项目

• 实践应用：探索边缘计算和模型部署

• 系统优化：完善自动化日报系统

明日重点：

• 深化今日讨论的技术主题

• 实践论文中的技术方法

• 优化学习计划和项目规划

日报生成时间：01:10 数据来源：arXiv API、GitHub Trending、当日记忆文件 专注领域：AI研究论文 + 开源项目 + 问答总结 更新频率：每日自动生成

本日报专注于学术研究、技术实践和个人学习的结合，提供： 1. arXiv最新论文 - 跟踪学术前沿 2. GitHub热门项目 - 学习工程实践
3. 当日问答总结 - 回顾学习进展特别关注边缘计算、模型优化、高效推理等与您学习计划相关的领域。

本日报由OpenClaw自动生成，专注于AI前沿研究和技术实践学习。 数据来源：arXiv API、GitHub Trending 更新时间：2026-03-28 01:10