David Johnstone
arXiv.org on Hacker News
Recent submissions with ten or more points
16
Why are there Seven Sisters?
6
24 Jan
213
Foundations of Large Language Models
20
23 Jan
149
Lossless Compression of Vector IDs for Approximate Nearest Neighbor Search
6
22 Jan
116
Flame: A small language model for spreadsheet formulas (2023)
18
22 Jan
158
Tensor Product Attention Is All You Need
103
22 Jan
12
Evolving Deeper LLM Thinking
0
20 Jan
58
ELIZA Reanimated
18
18 Jan
58
Mathematics of the daily word game Waffle
19
17 Jan
161
Titans: Learning to Memorize at Test Time
35
15 Jan
19
MathReader: Text-to-Speech for Mathematical Documents [pdf]
0
14 Jan
115
Titans: Learning to Memorize at Test Time
15
13 Jan
229
Learning how to think with Meta Chain-of-Thought
75
10 Jan
39
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking
7
9 Jan
446
Time-Series Anomaly Detection: A Decade Review
80
6 Jan
42
Did we miss P In CAP? Partial Progress Conjecture under Asynchrony
4
5 Jan
133
A path to O1 open source
80
3 Jan
381
Phase behavior of Cacio and Pepe sauce
192
3 Jan
218
TinyStories: How Small Can Language Models Be and Still Speak Coherent English? (2023)
104
2 Jan
23
Identifying and Manipulating LLM Personality Traits via Activation Engineering
9
Dec 2024
96
Beyond Gradient Averaging in Parallel Optimization
41
Dec 2024
81
How Well Do LLMs Generate Code for Different Application Domains?
25
Dec 2024
16
Gamma-ray bursts: what do we know today that we did not know 10 years ago?
0
Dec 2024
236
4.5M Suspected Fake Stars in GitHub
212
Dec 2024
40
Empirical Study of Test Generation with LLM's
36
Dec 2024
21
Measuring and Understanding LLM Identity Confusion
1
Dec 2024
89
Explaining Large Language Models Decisions Using Shapley Values
19
Dec 2024
47
Invariants: Computation and Applications
1
Dec 2024
107
Supernovae Evidence for Foundational Change to Cosmological Models
74
Dec 2024
306
Adversarial policies beat superhuman Go AIs (2023)
139
Dec 2024
111
Offline Reinforcement Learning for LLM Multi-Step Reasoning
9
Dec 2024
117
Tokenisation Is NP-Complete
24
Dec 2024
291
Compiling C to Safe Rust, Formalized
157
Dec 2024
19
Lightweight Safety Classification Using Pruned Language Models
3
Dec 2024
11
Leveraging LLM for Automated Ontology Extraction and Knowledge Graph Generation
1
Dec 2024
166
Classical sorting algorithms as a model of morphogenesis (2023)
79
Dec 2024
246
Cultural Evolution of Cooperation Among LLM Agents
131
Dec 2024
91
No More Adam: Learning Rate Scaling at Initialization Is All You Need
28
Dec 2024
68
Best-of-N Jailbreaking
15
Dec 2024
17
Best-of-N Jailbreaking
1
Dec 2024
10
Frontier Models are Capable of In-context Scheming
1
Dec 2024
12
Reachability Analysis of DNS
0
Dec 2024
21
Cyborg Insect Factory
5
Dec 2024
283
Training LLMs to Reason in a Continuous Latent Space
114
Dec 2024
14
Confidential Computing Platform Based on Tee and TPM Collaborative Trust
1
Dec 2024
44
The Hexagonal Tiling Honeycomb
9
Dec 2024
11
Kolmogorov Complexity, and the Role of Inductive Biases in Machine Learning
1
Dec 2024
131
Optimality of Gerver's Sofa
36
Dec 2024
248
Procedural knowledge in pretraining drives reasoning in large language models
101
Dec 2024
128
DynaSaur: Large Language Agents Beyond Predefined Actions
31
Dec 2024
Powered by
hn.algolia.com