共计 153 篇文章
2024
BitNet b1.58
Fuyu
Sora
DLinear-Are Transformers Effective for Time Forecasting
Depth Anything-Unleashing the Power of Large-Scale Unlabeled Data
Mamba---Linear-Time Sequence Modeling with Selective State Spaces
On Embeddings for Numerical Features in Tabular Deep Learning
Self-Supervision is All You Need for Solving Rubik’s Cube
2023
SC-NAFSSR
LLaMA2