HN Reader

NewTopBestAskShowJob

Ask HN: What's a good format to submit CSV data for LLMs?

score icon2
comment icon4
2 days agoby JimsonYang
I need to submit like 1000 rows of data to an llm so I can ask it for trends within the data. If I use json, I check gpt tokenizer and thats like 40 tokens per row(cuz headers were being referenced everytime leading to inefficiency). Meaning 40k input, which definitely would put me in context rot(hallucination) territory. And I heard using csv was very inaccurate. Any suggetions