David Johnstone
arXiv.org on Hacker News
Recent submissions with ten or more points
175
LoRA+: Efficient Low Rank Adaptation of Large Models
44
28 Apr
13
Step Differences in Instructional Video
0
28 Apr
156
Let's Think Dot by Dot: Hidden Computation in Transformer Language Models
31
27 Apr
72
Relational Graph Convolutional Networks for Sentiment Analysis
3
26 Apr
29
One Bad Apple Can Spoil Your IPv6 Privacy
6
26 Apr
48
CatLIP: Clip Vision Accuracy with 2.7x Faster Pre-Training on Web-Scale Data
4
25 Apr
97
Quaternion Knowledge Graph Embeddings (2019)
40
25 Apr
135
Removing Reflections from RAW Photos
30
24 Apr
126
Claude 3 beats Google Translate
118
23 Apr
410
Phi-3 Technical Report
129
23 Apr
128
FPGA Architecture for Deep Learning: Survey and Future Directions
52
22 Apr
77
Survey Study on AI Agent Architectures (2024)
16
22 Apr
61
Many-Shot In-Context Learning
1
22 Apr
47
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
3
22 Apr
136
Lossless Acceleration of LLM via Adaptive N-Gram Parallel Decoding
23
21 Apr
44
Eight Transaction Papers by Jim Gray
9
19 Apr
124
Chinchilla Scaling: A replication attempt
68
18 Apr
92
Collapse of self-trained language models
30
17 Apr
88
The Ballmer Peak: An Empirical Search
24
17 Apr
167
Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length
28
16 Apr
124
ResearchAgent: Iterative Research Idea Generation Using LLMs
63
16 Apr
38
We have no idea how models will behave in production until production
3
15 Apr
29
ChatGPT Can Predict the Future Telling Stories Set in the Future About the Past
8
14 Apr
13
Mechanics of Next Token Prediction with Self-Attention
0
13 Apr
119
Your LLM Is a Capable Regressor When Given In-Context Examples
36
13 Apr
59
Fine-Tuning Increases LLM Vulnerabilities and Risk
33
11 Apr
14
Autonomous LLM agents with human-out-of-loop
8
11 Apr
39
Leave No Context Behind: Efficient Infinite Context Transformers
4
11 Apr
24
Toward Inference-Optimal Mixture-of-Expert Large Language Models
0
10 Apr
16
A Survey on Red Teaming for Generative Models
0
10 Apr
71
Evaluating faithfulness and content selection of LLMs in book-length summaries
7
9 Apr
21
AI consciousness is inevitable: A theoretical computer science perspective
54
9 Apr
104
Social Skill Training with Large Language Models
100
9 Apr
53
Apple Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs
7
9 Apr
52
Direct Nash Optimization: Teaching language models to self-improve
11
8 Apr
71
Nightfall: Can Kalgash Exist (2014)
10
8 Apr
281
Mixture-of-Depths: Dynamically allocating compute in transformers
83
7 Apr
29
Rendering string diagrams recursively [pdf]
4
7 Apr
54
Sophia: Scalable Stochastic 2nd-Order Optimizer for Language Model Pre-Training
2
7 Apr
288
More Agents Is All You Need: LLMs performance scales with the number of agents
206
6 Apr
26
Long-form factuality in large language models
16
6 Apr
105
Language models are Super Mario: Absorbing abilities from homologous models
70
6 Apr
189
AI and the Problem of Knowledge Collapse
127
5 Apr
170
Language models as compilers: Simulating pseudocode execution
52
4 Apr
120
Rule-based NLP system beats LLM for analysis of psychiatric clinical notes
19
4 Apr
83
The Solution of the Zodiac Killer's 340-Character Cipher
4
3 Apr
91
Octopus v2: On-device language model for super agent
17
3 Apr
140
ReALM: Reference Resolution as Language Modeling
15
2 Apr
12
ReALM: Reference Resolution as Language Modeling
1
2 Apr
Powered by
hn.algolia.com