Research Papers Archive

Complete archive with search functionality

3 of 3 items

Continuous Thought Machines
Done

Luke Darlow, Ciaran Regan, Sebastian Risi +2 more

arXiv:2505.05522 5/8/2025

Biological brains demonstrate complex neural activity, where the timing and interplay between neurons is critical to how brains process information. Most deep learning architecture...

cs.LG
cs.AI
arXiv
AutoPrompt: Eliciting Knowledge from Language Models with Automatically Generated Prompts
Done

Taylor Shin, Yasaman Razeghi, Robert L. Logan IV +2 more

arXiv:2010.15980 10/29/2020

The remarkable success of pretrained language models has motivated the study of what kinds of knowledge these models learn during pretraining. Reformulating tasks as fill-in-the-bl...

cs.CL
cs.LG
arXiv
Specifications: The missing link to making the development of LLM systems an engineering discipline
Done

Ion Stoica, Matei Zaharia, Joseph Gonzalez +8 more

arXiv:2412.05299 11/25/2024

Despite the significant strides made by generative AI in just a few short years, its future progress is constrained by the challenge of building modular and robust systems. This ca...

cs.SE
cs.AI
cs.CL
arXiv