HN Reader
New
Top
Best
Ask
Show
Job
Refrag: Rethinking RAG Based Decoding
2
1
4 hours ago
by datadrivenangel
Am I misunderstanding this or is basically just taking RAG results and doing a vector search on the results and only passing some to the context window?
Also, why do these AI papers never get speedup times in human time units?
4 hours ago
by datadrivenangel