Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
|
from
login
Beyond Standard LLMs
(
sebastianraschka.com
)
1 point
by
ibobev
2 days ago
|
past
|
discuss
A Researcher's Field Guide to Non-Standard LLM Architectures
(
sebastianraschka.com
)
2 points
by
ModelForge
2 days ago
|
past
|
discuss
Understanding the 4 Main Approaches to LLM Evaluation (From Scratch)
(
sebastianraschka.com
)
1 point
by
ibobev
22 days ago
|
past
Popular Attention Alternatives: GQA, MLA, SWA
(
sebastianraschka.com
)
4 points
by
ModelForge
22 days ago
|
past
Multi-Head Latent Attention
(
sebastianraschka.com
)
4 points
by
ModelForge
24 days ago
|
past
Understanding the 4 Main Approaches to LLM Evaluation (From Scratch)
(
sebastianraschka.com
)
2 points
by
ibobev
27 days ago
|
past
LLM Evaluation from Scratch: Multiple Choice, Verifiers, Leaderboards, LLM Judge
(
sebastianraschka.com
)
4 points
by
ModelForge
32 days ago
|
past
Understanding and Implementing Qwen3 from Scratch
(
sebastianraschka.com
)
1 point
by
ibobev
52 days ago
|
past
GPT-OSS vs. Qwen3 and a detailed look how things evolved since GPT-2
(
sebastianraschka.com
)
490 points
by
ModelForge
88 days ago
|
past
|
97 comments
From GPT-2 to GPT-OSS: Analyzing the Architectural Advances
(
sebastianraschka.com
)
3 points
by
mdp2021
89 days ago
|
past
PyTorch in One Hour: From Tensors to Training Neural Networks on Multiple GPUs
(
sebastianraschka.com
)
1 point
by
Anon84
3 months ago
|
past
PyTorch in One Hour: From Tensors to Training Neural Networks on Multiple GPUs
(
sebastianraschka.com
)
4 points
by
mariuz
3 months ago
|
past
LLM architecture comparison
(
sebastianraschka.com
)
418 points
by
mdp2021
3 months ago
|
past
|
24 comments
The Big LLM Architecture Comparison
(
sebastianraschka.com
)
3 points
by
Quizzical4230
3 months ago
|
past
Comprehensive ML/AI questions and answers for interview prep
(
sebastianraschka.com
)
2 points
by
yaiml
4 months ago
|
past
PyTorch in One Hour: From Tensors to Training Neural Networks on Multiple GPUs
(
sebastianraschka.com
)
4 points
by
sbbq
4 months ago
|
past
Intermediate ML and AI questions and answers for interview prep
(
sebastianraschka.com
)
3 points
by
sbbq
4 months ago
|
past
Understanding and Coding the KV Cache in LLMs from Scratch
(
sebastianraschka.com
)
6 points
by
sbbq
4 months ago
|
past
Understanding and Coding the KV Cache in LLMs from Scratch
(
sebastianraschka.com
)
2 points
by
tosh
4 months ago
|
past
Coding LLMs from the Ground Up: A Complete Course
(
sebastianraschka.com
)
4 points
by
sbbq
4 months ago
|
past
Coding LLMs from the Ground Up: A Complete Course
(
sebastianraschka.com
)
2 points
by
mdp2021
6 months ago
|
past
The State of Reinforcement Learning for LLM Reasoning
(
sebastianraschka.com
)
8 points
by
yaiml
6 months ago
|
past
The State of Reinforcement Learning for LLM Reasoning
(
sebastianraschka.com
)
9 points
by
jonbaer
6 months ago
|
past
The State of Reinforcement Learning for LLM Reasoning
(
sebastianraschka.com
)
4 points
by
mdp2021
6 months ago
|
past
The State of LLM Reasoning Models
(
sebastianraschka.com
)
2 points
by
Philpax
7 months ago
|
past
The State of Reasoning Models
(
sebastianraschka.com
)
4 points
by
sbbq
8 months ago
|
past
The State of LLM Reasoning Models Part 1: Inference-Time Compute Scaling Methods
(
sebastianraschka.com
)
3 points
by
yaiml
8 months ago
|
past
Understanding Reasoning LLMs
(
sebastianraschka.com
)
473 points
by
sebg
9 months ago
|
past
|
183 comments
Understanding Reasoning LLMs
(
sebastianraschka.com
)
4 points
by
sbbq
9 months ago
|
past
Noteworthy LLM Research Papers of 2024 Megapost
(
sebastianraschka.com
)
5 points
by
yaiml
9 months ago
|
past
More
Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: