r/LocalLLaMA
500 articles archived · Visit source ↗ · RSS
-
-
r/LocalLLaMA community 1d ago
Running Hunyuan3D Image to 3D Object on an iPhone
  submitted by   /u/arduinoRPi4 [link]   [comments]
22 -
-
-
r/LocalLLaMA community 1d ago
Well.. it's a step up from nonstop bot spam I guess
  submitted by   /u/ForsookComparison [link]   [comments]
27 -
-
r/LocalLLaMA community 1d ago
Explaining Attention with Program Synthesis
The same day I discovered Tracr, this paper dropped. Very interesting and potentially accelerates LLM training significantly. The idea of programmable attention seems promising.   submitted by   /u/Thrumpwart [link]   [comments]
27 -
r/LocalLLaMA community 1d ago
NEW on Hugging Face: Filter by hardware compatibility
  submitted by   /u/paf1138 [link]   [comments]
13 -
-
-
-
-
-
r/LocalLLaMA community 2d ago
Introducing LongCat-2.0 - , a large-scale MoE language model with 1.6 trillion total parameters and ~48 billion activated per token. This was the stealth model that was on Openrouter under the name 'owl-alpha'.
  submitted by   /u/AnticitizenPrime [link]   [comments]
18 -
r/LocalLLaMA community 2d ago
on Dario’s statement
  submitted by   /u/turtle-toaster [link]   [comments]
32 -
-
-
r/LocalLLaMA community 2d ago
Amodei: "Open Source Models Will Eat Your Children"
  submitted by   /u/johnnyApplePRNG [link]   [comments]
35 -
r/LocalLLaMA community 2d ago
Samsung, SK hynix, Micron Sued in US Over Memory Price Fixing
  submitted by   /u/johnnyApplePRNG [link]   [comments]
15 -
r/LocalLLaMA community 2d ago
Effect of GLM 5.2 !!
All hail Z. Ai   submitted by   /u/Independent-Wind4462 [link]   [comments]
13 -
-
r/LocalLLaMA community 2d ago
Mellum2 local deployments
Hey local community, I work at JetBrains with the team that trained Mellum2 models — 12B-2.5A LLMs. Those models are trained completely from scratch, targeting fast inference: our primary goal were H100/H200s prod deployments, but local deployments are good as well. We…
37