Learn and Burn
Subscribe
Sign in
Home
Archive
About
Latest
Top
Discussions
Measuring a model's understanding — starting with path-finding
[Paper: Evaluating the World Model Implicit in a Generative Model]
Nov 26, 2024
•
Unbox Research
1
Making LLMs scalable by replacing weights with learnable tokens
[Paper: Tokenformer: Rethinking Transformer Scaling with Tokenized Model Parameters]
Nov 19, 2024
•
Unbox Research
2
Image generation for infinite games
[Paper: Unbounded: A Generative Infinite Game of Character Life Simulation]
Nov 10, 2024
•
Unbox Research
1
Do LLMs rely on data contamination to solve math problems?
[Paper: GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models]
Nov 2, 2024
•
Unbox Research
1
1
October 2024
Running an LLM on a small customizable chip
[Paper: LlamaF: An Efficient Llama2 Architecture Accelerator on Embedded FPGAs]
Oct 30, 2024
•
Unbox Research
2
Better language models with negative attention
[Paper: Differential Transformer]
Oct 18, 2024
•
Unbox Research
2
A serious look at the future of AI medical advice
[Paper: A Preliminary Study of o1 in Medicine: Are We Closer to an AI Doctor?]
Oct 12, 2024
•
Unbox Research
1
LLMs have original, research-worthy ideas
[Paper: Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers]
Oct 8, 2024
•
Unbox Research
2
September 2024
OpenAI's o1 model
[Article: Learning to Reason with LLMs]
Sep 27, 2024
•
Unbox Research
1
The subgoals of attention units in LLMs
[Paper: Attention Heads of Large Language Models: A Survey]
Sep 20, 2024
•
Unbox Research
1
A model to parse body language
[Paper: Sapiens: Foundation Model for Human Vision Models]
Sep 16, 2024
•
Unbox Research
1
Stable diffusion can simulate video games
[Paper: Diffusion Models are Real-Time Game Engines]
Sep 6, 2024
•
Unbox Research
1
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts