home
home
Tag cloud
Picture wall
Daily
RSS Feed
Login
Remember me
2732
shaares
32
private links
2732
shaares ·
32
private links
Filters
Links per page
20
50
100
[2101.03961] Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
ml
·
nlp
January 14, 2021 at 5:28:14 PM EST ·
permalink
·
https://arxiv.org/abs/2101.03961
Filters
Links per page
20
50
100
Fold
Fold all
Expand
Expand all
Are you sure you want to delete this link?
Are you sure you want to delete this tag?
The personal, minimalist, super fast, database-free, bookmarking service by the Shaarli community