David Johnstone
arXiv.org on Hacker News
Recent submissions with ten or more points
25
Talaria: Interactively Optimizing Machine Learning Models for Efficient Inferenc
1
9 Sep
37
ReMamba: Equip Mamba with Effective Long-Sequence Modeling
1
9 Sep
44
Manipulating large language models to increase product visibility
19
6 Sep
258
Hardware Acceleration of LLMs: A comprehensive survey and comparison
68
6 Sep
66
Generalized Carlos Scales
8
4 Sep
38
Fibonacci Partial Sums Tricks
2
4 Sep
139
Keeping CALM: When distributed consistency is easy (2019)
27
3 Sep
59
Smaller, Weaker, yet Better: Training LLM Reasoners via Compute-Optimal Sampling
5
3 Sep
107
Inductive or deductive? Rethinking the fundamental reasoning abilities of LLMs
169
2 Sep
24
Radiance Cascades: A Novel High-Res Sol. For Multidim Non-LTE Radiative Transfer
7
31 Aug
23
Architectural Effects on Maximum Dependency Lengths of Recurrent Neural Networks
3
30 Aug
26
How big a table do you need for your jigsaw puzzle?
3
28 Aug
80
High-temperature Gibbs states are unentangled and efficiently preparable
33
28 Aug
58
Sapiens: Foundation for Human Vision Models
1
28 Aug
62
The Big Fringe Telescope
19
26 Aug
35
Realistic Synthetic UGC: A Scaffolding Approach to Generating Online Discussions
6
25 Aug
28
An exploration of Bluesky's public opening
46
23 Aug
38
StructuredRAG: JSON Response Formatting with Large Language Models
4
22 Aug
14
Profiling Programming Language Learning
0
20 Aug
118
Uniqueness Bias: Why it matters, how to curb it
67
18 Aug
116
DifuzCam: Replacing Camera Lens with a Mask and a Diffusion Model
47
17 Aug
28
Synthesizing Abstract Transformers for Reduced-Product Domains
0
16 Aug
165
Does Reasoning Emerge? Probabilities of Causation in Large Language Models
192
16 Aug
20
Galois Theory of Algorithms (2018) [pdf]
3
15 Aug
27
Are Emergent Abilities in Large Language Models Just In-Context Learning?
0
13 Aug
79
Tree Attention: Topology-Aware Decoding for Long-Context
22
11 Aug
198
OpenDevin: An Open Platform for AI Software Developers as Generalist Agents
107
11 Aug
56
Apple Intelligence Foundation Language Models
23
9 Aug
98
GPUDrive: Data-driven, multi-agent driving simulation at 1M FPS
10
8 Aug
233
Self-Compressing Neural Networks
57
4 Aug
118
LLM as Database Administrator (2023)
31
4 Aug
88
Non-computability of solutions of certain equations on digital computers (2022)
102
3 Aug
53
Prover-Verifier Games improve legibility of LLM outputs
3
3 Aug
111
Probability Estimates of a 21st Century AMOC Collapse
53
3 Aug
13
Where Are Large Language Models for Code Generation on GitHub?
7
2 Aug
66
Baidu's Improving Retrieval Augmented Language Model with Self-Reasoning
4
1 Aug
157
The Genomic Code: The genome instantiates a generative model of the organism
38
1 Aug
86
Deep-Tempest: Using Deep Learning to Eavesdrop on HDMI
15
31 Jul
208
Diffusion Training from Scratch on a Micro-Budget
27
30 Jul
55
Deep Learning Interviews (2021)
10
26 Jul
74
Dazed and Confused: A Large-Scale Real-World User Study of ReCAPTCHAv (2023)
53
24 Jul
83
A Multimodal Automated Interpretability Agent
7
24 Jul
12
Domain-Aware Fine-Tuning of Foundation Models
0
23 Jul
62
Planck stars, White Holes, Remnants and Planck-mass quasi-particles
32
22 Jul
70
Satellite Drag Analysis During the May 2024 Geomagnetic Storm
7
21 Jul
190
SpreadsheetLLM: Encoding Spreadsheets for Large Language Models
69
17 Jul
10
Bringing Auto-Tuning to Hip: Comparing Performance Tuning on AMD and Nvidia
2
17 Jul
231
XLSTMTime: Long-Term Time Series Forecasting with xLSTM
53
16 Jul
Powered by
hn.algolia.com