David Johnstone
arXiv.org on Hacker News
Recent submissions with ten or more points
477
Were RNNs all we needed?
247
3 Oct
238
Serving 70B-scale LLMs efficiently on low-resource edge devices [pdf]
58
3 Oct
31
A Comprehensive Analysis of Package Hallucinations by Code Generating LLMs
9
1 Oct
179
On the design of text editors (2020)
82
28 Sep
250
Collaborative text editing with Eg-Walker: Better, faster, smaller
30
27 Sep
124
LlamaF: An Efficient Llama2 Architecture Accelerator on Embedded FPGAs
29
27 Sep
159
Automatic Content Recognition Tracking in Smart TVs
155
26 Sep
43
The Impact of Element Ordering on LM Agent Performance
2
24 Sep
44
NeRF-Supervised Feature Point Detection and Description
1
23 Sep
51
AI Companions Reduce Loneliness
81
21 Sep
42
Dissociating language and thought in large language models
4
21 Sep
10
SwiGLU activation function causes instability in FP8 LLM training
2
21 Sep
230
Training Language Models to Self-Correct via Reinforcement Learning
92
20 Sep
73
A rigid but foldable indoor airship aerial system for cave exploration
45
20 Sep
20
Late Chunking: Contextual Chunk Embeddings Using Long-Context Embedding Models
3
20 Sep
32
Some remarks on the mathematical structure of the multiverse (2016)
21
17 Sep
33
What Is Entropy?
5
17 Sep
261
Chain of Thought empowers transformers to solve inherently serial problems
184
17 Sep
291
LLMs Will Always Hallucinate, and We Need to Live with This
261
14 Sep
139
The Legend of Holy Sword: An Immersive Experience for Concentration Enhancement
66
13 Sep
221
Tutorial on diffusion models for imaging and vision
18
10 Sep
80
Pixhell Attack: Leaking Info from Air-Gap Computers via 'Singing Pixels'
30
10 Sep
38
ChartEye: A Deep Learning Framework for Chart Information Extraction
0
10 Sep
80
Deductive Verification for Chain-of-Thought Reasoning in LLMs
20
10 Sep
41
Talaria: Interactively Optimizing Machine Learning Models for Efficient Inferenc
6
9 Sep
40
ReMamba: Equip Mamba with Effective Long-Sequence Modeling
1
9 Sep
45
Manipulating large language models to increase product visibility
19
6 Sep
266
Hardware Acceleration of LLMs: A comprehensive survey and comparison
68
6 Sep
66
Generalized Carlos Scales
8
4 Sep
38
Fibonacci Partial Sums Tricks
2
4 Sep
139
Keeping CALM: When distributed consistency is easy (2019)
27
3 Sep
59
Smaller, Weaker, yet Better: Training LLM Reasoners via Compute-Optimal Sampling
5
3 Sep
107
Inductive or deductive? Rethinking the fundamental reasoning abilities of LLMs
169
2 Sep
24
Radiance Cascades: A Novel High-Res Sol. For Multidim Non-LTE Radiative Transfer
7
31 Aug
23
Architectural Effects on Maximum Dependency Lengths of Recurrent Neural Networks
3
30 Aug
26
How big a table do you need for your jigsaw puzzle?
3
28 Aug
80
High-temperature Gibbs states are unentangled and efficiently preparable
33
28 Aug
58
Sapiens: Foundation for Human Vision Models
1
28 Aug
62
The Big Fringe Telescope
19
26 Aug
35
Realistic Synthetic UGC: A Scaffolding Approach to Generating Online Discussions
6
25 Aug
28
An exploration of Bluesky's public opening
46
23 Aug
38
StructuredRAG: JSON Response Formatting with Large Language Models
4
22 Aug
14
Profiling Programming Language Learning
0
20 Aug
118
Uniqueness Bias: Why it matters, how to curb it
67
18 Aug
116
DifuzCam: Replacing Camera Lens with a Mask and a Diffusion Model
47
17 Aug
28
Synthesizing Abstract Transformers for Reduced-Product Domains
0
16 Aug
Powered by
hn.algolia.com