David Johnstone
arXiv.org on Hacker News
Recent submissions with ten or more points
42
Deep Learning Interviews (2021)
7
26 Jul
83
A Multimodal Automated Interpretability Agent
7
24 Jul
12
Domain-Aware Fine-Tuning of Foundation Models
0
23 Jul
62
Planck stars, White Holes, Remnants and Planck-mass quasi-particles
32
22 Jul
69
Satellite Drag Analysis During the May 2024 Geomagnetic Storm
7
21 Jul
190
SpreadsheetLLM: Encoding Spreadsheets for Large Language Models
69
17 Jul
10
Bringing Auto-Tuning to Hip: Comparing Performance Tuning on AMD and Nvidia
2
17 Jul
231
XLSTMTime: Long-Term Time Series Forecasting with xLSTM
53
16 Jul
66
Parametric Matrix Models
7
16 Jul
184
Large models of what? Mistaking engineering achievements for linguistic agency
156
16 Jul
65
Electra: Pre-Training Text Encoders as Discriminators Rather Than Generators (2020)
9
16 Jul
99
Transformer Layers as Painters
10
15 Jul
56
Lagrange: LAser GRavitational-wave ANtenna at GEo-lunar Lagrange points (2011)
17
15 Jul
11
New large value estimates for Dirichlet polynomials
1
14 Jul
307
Fitting an elephant with four non-zero parameters
147
14 Jul
39
Compact Fenwick trees for dynamic ranking and selection (2019)
5
14 Jul
12
Exploring the Limits of Transfer Learning with a Unified Transformer (2019)
1
13 Jul
108
WildGaussians: 3D Gaussian Splatting in the Wild
19
12 Jul
12
Fitting an Elephant with Four Non-Zero Parameters
1
12 Jul
27
Training a time series model using transformers at Datadog
0
11 Jul
130
A relativistic framework to establish coordinate time on the Moon and beyond
72
11 Jul
288
An abundance of Katherines: The game theory of baby naming
148
10 Jul
58
Dola Decoding by Contrasting Layers Improves Factuality in Large Language Models
43
10 Jul
142
Training of Physical Neural Networks
46
10 Jul
29
Grokking the Sequent Calculus (Functional Pearl)
1
9 Jul
389
C++ patterns for low-latency applications including high-frequency trading
231
8 Jul
10
Learning to (Learn at Test Time): RNNs with Expressive Hidden States
0
8 Jul
214
Reasoning in Large Language Models: A Geometric Perspective
170
7 Jul
180
GPU-Friendly Stroke Expansion
39
2 Jul
18
MoonshotAI unveils Kimi's large-scale LLM serving architecture
1
2 Jul
17
Large language models have developed a higher-order theory of mind
4
2 Jul
81
Did Turing prove the undecidability of the halting problem?
105
2 Jul
165
Newswire: A large-scale structured database of a century of historical news
39
30 Jun
36
Edelman's Steps Toward a Conscious Artifact (2021)
40
29 Jun
101
Artificial needles to real haystacks: Improving retrieval capabilities in LLMs
21
29 Jun
79
Category theory using string diagrams (2014)
24
28 Jun
141
Computational Life: How self-replicating programs emerge from simple interaction
17
28 Jun
75
ELIZA Reinterpreted: The world's first chatbot was not intended as a chatbot
24
26 Jun
26
Indications of superconductivities in blend of variant apatite and covellite
20
26 Jun
16
Compressing graphs and indexes with recursive graph bisection (2016)
6
24 Jun
172
Deriving Dependently-Typed OOP from First Principles
18
23 Jun
104
SquirrelFS: Using the Rust compiler to check file-system crash consistency
10
23 Jun
48
The Origin of Jupiter's Great Red Spot
8
23 Jun
164
Delving into ChatGPT usage in academic writing through excess vocabulary
105
22 Jun
34
Q*: Improving Multi-Step Reasoning for LLMs with Deliberative Planning
3
21 Jun
36
There are no particles, there are only fields (2012)
35
20 Jun
11
RAR-B: Reasoning as Retrieval Benchmark
0
20 Jun
209
Refusal in language models is mediated by a single direction
44
18 Jun
180
TikTag: Breaking ARM's memory tagging extension with speculative execution
26
18 Jun
Powered by
hn.algolia.com