David Johnstone
arXiv.org on Hacker News
Recent submissions with ten or more points
181
DeepRAG: Thinking to retrieval step by step for large language models
25
4 Feb
167
Efficient Reasoning with Hidden Thinking
41
3 Feb
82
Reinforcement Learning: An Overview
12
2 Feb
88
Large Language Models for Mathematicians (2023)
28
1 Feb
87
Gradual Disempowerment: How Even Incremental AI Progress Poses Existential Risks
84
1 Feb
107
Theoretical limitations of multi-layer Transformer
22
31 Jan
118
Large language models think too fast to explore effectively
41
31 Jan
224
TopoNets: High performing vision and language models with brain-like topography
67
31 Jan
134
Auto-Differentiating Any LLM Workflow: A Farewell to Manual Prompting
32
29 Jan
52
3D scene reconstruction in adverse weather conditions via Gaussian splatting
14
28 Jan
1349
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via RL
1055
25 Jan
16
Why are there Seven Sisters?
8
24 Jan
89
A Faster Quantum Fourier Transform
6
23 Jan
219
Foundations of Large Language Models
20
23 Jan
151
Lossless Compression of Vector IDs for Approximate Nearest Neighbor Search
6
22 Jan
117
Flame: A small language model for spreadsheet formulas (2023)
18
22 Jan
160
Tensor Product Attention Is All You Need
104
22 Jan
12
Evolving Deeper LLM Thinking
0
20 Jan
59
ELIZA Reanimated
18
18 Jan
58
Mathematics of the daily word game Waffle
19
17 Jan
161
Titans: Learning to Memorize at Test Time
35
15 Jan
19
MathReader: Text-to-Speech for Mathematical Documents [pdf]
0
14 Jan
115
Titans: Learning to Memorize at Test Time
15
13 Jan
229
Learning how to think with Meta Chain-of-Thought
75
10 Jan
39
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking
7
9 Jan
446
Time-Series Anomaly Detection: A Decade Review
80
6 Jan
42
Did we miss P In CAP? Partial Progress Conjecture under Asynchrony
4
5 Jan
133
A path to O1 open source
80
3 Jan
381
Phase behavior of Cacio and Pepe sauce
192
3 Jan
218
TinyStories: How Small Can Language Models Be and Still Speak Coherent English? (2023)
104
2 Jan
23
Identifying and Manipulating LLM Personality Traits via Activation Engineering
9
Dec 2024
96
Beyond Gradient Averaging in Parallel Optimization
41
Dec 2024
81
How Well Do LLMs Generate Code for Different Application Domains?
25
Dec 2024
16
Gamma-ray bursts: what do we know today that we did not know 10 years ago?
0
Dec 2024
236
4.5M Suspected Fake Stars in GitHub
212
Dec 2024
40
Empirical Study of Test Generation with LLM's
36
Dec 2024
21
Measuring and Understanding LLM Identity Confusion
1
Dec 2024
89
Explaining Large Language Models Decisions Using Shapley Values
19
Dec 2024
47
Invariants: Computation and Applications
1
Dec 2024
107
Supernovae Evidence for Foundational Change to Cosmological Models
74
Dec 2024
306
Adversarial policies beat superhuman Go AIs (2023)
139
Dec 2024
111
Offline Reinforcement Learning for LLM Multi-Step Reasoning
9
Dec 2024
117
Tokenisation Is NP-Complete
24
Dec 2024
291
Compiling C to Safe Rust, Formalized
157
Dec 2024
19
Lightweight Safety Classification Using Pruned Language Models
3
Dec 2024
11
Leveraging LLM for Automated Ontology Extraction and Knowledge Graph Generation
1
Dec 2024
166
Classical sorting algorithms as a model of morphogenesis (2023)
79
Dec 2024
246
Cultural Evolution of Cooperation Among LLM Agents
131
Dec 2024
91
No More Adam: Learning Rate Scaling at Initialization Is All You Need
28
Dec 2024
Powered by
hn.algolia.com