David Johnstone
arXiv.org on Hacker News
Recent submissions with ten or more points
168
LLMs get lost in multi-turn conversation
104
Today
252
Type-constrained code generation with language models
120
13 May
121
TransMLA: Multi-head latent attention is all you need
32
13 May
34
Toward a Sparse and Interpretable Audio Codec
2
12 May
107
Byte latent transformer: Patches scale better than tokens (2024)
22
12 May
88
Absolute Zero: Reinforced Self-Play Reasoning with Zero Data
18
11 May
19
Scoring the European Citizen in the AI Era
1
11 May
27
Human-Like Episodic Memory for Infinite Context LLMs
0
8 May
15
LLMs for Materials and Chemistry: 34 Real-World Examples
1
7 May
10
DoomArena: A Framework for Testing AI Agents Against Evolving Security Threats
2
6 May
177
Analyzing Modern Nvidia GPU Cores
37
5 May
27
Why it is (nearly) impossible that we live in a simulation
82
5 May
45
Structuring Competency-Based Courses Through Skill Trees
26
5 May
230
Matrix-vector multiplication implemented in off-the-shelf DRAM for Low-Bit LLMs
53
4 May
10
Your ViT Is Secretly an Image Segmentation Model
0
4 May
91
A Survey of AI Agent Protocols
63
4 May
25
The Algebra of Patterns (Extended Version)
2
3 May
46
Stop treating `AGI' as the north-star goal of AI research
32
3 May
124
LLMs for Engineering: Teaching Models to Design High Powered Rockets
45
30 Apr
184
The Leaderboard Illusion
51
30 Apr
17
Beyond Performance: Measuring the environmental impact of analytical databases
4
29 Apr
94
Vision Transformers Need Registers
9
28 Apr
69
Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models
9
28 Apr
39
Do Large Language Models know who did what to whom?
5
27 Apr
411
Lossless LLM compression for efficient GPU inference via dynamic-length float
117
25 Apr
133
Paper2Code: Automating Code Generation from Scientific Papers
27
25 Apr
84
PyGraph: Robust Compiler Support for CUDA Graphs in PyTorch
8
24 Apr
71
Three things everyone should know about Vision Transformers
17
24 Apr
48
Should We Respect LLMs? A Study on Influence of Prompt Politeness on Performance
105
22 Apr
34
Quantum-assured magnetic navigation with higher positioning accuracy than GPS
9
22 Apr
40
Flat origami is Turing complete (2023)
12
22 Apr
26
Pydrofoil: Accelerating Sail-based instruction set simulators
2
21 Apr
70
Ultra-precision formation flying demonstration for space-based interferometry
23
21 Apr
95
Pushing the Limits of LLM Quantization via the Linearity Theorem
2
20 Apr
14
Vending-Bench: A Benchmark for Long-Term Coherence of Autonomous Agents
0
19 Apr
69
Inferring the Phylogeny of Large Language Models
6
19 Apr
71
CaMeL: Defeating Prompt Injections by Design
16
19 Apr
38
SDFs from Unoriented Point Clouds Using Neural Variational Heat Distances
5
18 Apr
111
BitNet b1.58 2B4T Technical Report
30
17 Apr
45
Eccfrog512ck2: An Enhanced 512-Bit Weierstrass Elliptic Curve [pdf]
16
16 Apr
21
Reasoning Models Can Be Effective Without Thinking
2
16 Apr
33
M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models
3
15 Apr
248
Teuken-7B-Base and Teuken-7B-Instruct: Towards European LLMs (2024)
95
15 Apr
30
All-in-Memory Stochastic Computing Using ReRAM
1
14 Apr
13
MooseAgent: A LLM Based Multi-Agent Framework for Automating Moose Simulation
0
14 Apr
19
Our quantum assembly parser got updated to the QASM 3.0 spec
0
14 Apr
161
NoProp: Training neural networks without back-propagation or forward-propagation
49
14 Apr
25
Transfer between Modalities with MetaQueries
12
12 Apr
30
Visualizing a Million Time Series with the Density Line Chart
3
9 Apr
Powered by
hn.algolia.com