David Johnstone
arXiv.org on Hacker News
Recent submissions with ten or more points
10
Frontier AI systems have surpassed the self-replicating red line
3
10 Feb
79
Scaling up test-time compute with latent reasoning: A recurrent depth approach
18
10 Feb
170
PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models
78
9 Feb
378
LIMO: Less Is More for Reasoning
126
9 Feb
17
Frontier AI systems have surpassed the self-replicating red line
5
9 Feb
11
Demystifying Long Chain-of-Thought Reasoning in LLMs
0
8 Feb
15
Bolt: Bootstrap long chain-of-thought in LLMs without distillation [pdf]
5
8 Feb
68
Value-Based Deep RL Scales Predictably
3
8 Feb
14
Competition and survival in modern academia: A bibliometric case study
3
7 Feb
62
Gold-Medalist Performance in Solving Olympiad Geometry with AlphaGeometry2
5
7 Feb
65
HippoRAG: Neurobiologically Inspired Long-Term Memory for LLMs (2024)
4
7 Feb
139
Robust autonomy emerges from self-play
61
7 Feb
147
Pre-Trained Large Language Models Use Fourier Features for Addition (2024)
40
6 Feb
11
Digital Agent outperforms o1 by 15% – trained with new RL-variant similar to R1
0
5 Feb
191
DeepRAG: Thinking to retrieval step by step for large language models
29
4 Feb
172
Efficient Reasoning with Hidden Thinking
43
3 Feb
82
Reinforcement Learning: An Overview
12
2 Feb
89
Large Language Models for Mathematicians (2023)
28
1 Feb
87
Gradual Disempowerment: How Even Incremental AI Progress Poses Existential Risks
84
1 Feb
107
Theoretical limitations of multi-layer Transformer
22
31 Jan
118
Large language models think too fast to explore effectively
41
31 Jan
225
TopoNets: High performing vision and language models with brain-like topography
68
31 Jan
137
Auto-Differentiating Any LLM Workflow: A Farewell to Manual Prompting
32
29 Jan
53
3D scene reconstruction in adverse weather conditions via Gaussian splatting
14
28 Jan
1351
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via RL
1056
25 Jan
16
Why are there Seven Sisters?
8
24 Jan
89
A Faster Quantum Fourier Transform
6
23 Jan
219
Foundations of Large Language Models
20
23 Jan
151
Lossless Compression of Vector IDs for Approximate Nearest Neighbor Search
6
22 Jan
117
Flame: A small language model for spreadsheet formulas (2023)
18
22 Jan
160
Tensor Product Attention Is All You Need
104
22 Jan
12
Evolving Deeper LLM Thinking
0
20 Jan
59
ELIZA Reanimated
18
18 Jan
58
Mathematics of the daily word game Waffle
19
17 Jan
161
Titans: Learning to Memorize at Test Time
35
15 Jan
19
MathReader: Text-to-Speech for Mathematical Documents [pdf]
0
14 Jan
115
Titans: Learning to Memorize at Test Time
15
13 Jan
229
Learning how to think with Meta Chain-of-Thought
75
10 Jan
39
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking
7
9 Jan
446
Time-Series Anomaly Detection: A Decade Review
80
6 Jan
42
Did we miss P In CAP? Partial Progress Conjecture under Asynchrony
4
5 Jan
133
A path to O1 open source
80
3 Jan
381
Phase behavior of Cacio and Pepe sauce
192
3 Jan
218
TinyStories: How Small Can Language Models Be and Still Speak Coherent English? (2023)
104
2 Jan
23
Identifying and Manipulating LLM Personality Traits via Activation Engineering
9
Dec 2024
96
Beyond Gradient Averaging in Parallel Optimization
41
Dec 2024
81
How Well Do LLMs Generate Code for Different Application Domains?
25
Dec 2024
16
Gamma-ray bursts: what do we know today that we did not know 10 years ago?
0
Dec 2024
236
4.5M Suspected Fake Stars in GitHub
212
Dec 2024
Powered by
hn.algolia.com