Fine-tuning the Talkie 13B 1930 model on agentic trajectories
Ricardo
ricdomolm
AI & ML interests
LLMs
Recent Activity
updated a model about 1 hour ago
ricdomolm/talkie-web-coder updated a model about 2 hours ago
ricdomolm/talkie-1930-coder updated a collection about 2 hours ago
1930 CoderOrganizations
1930 Coder
Fine-tuning the Talkie 13B 1930 model on agentic trajectories
Computational Arbitrage
Models and datasets for the paper "Computational Arbitrage in AI Model Markets"
mini-coder
Small models for agentic SWE research: https://ricardodominguez.github.io/blogs/minicoder.html
Training on the test task models
Models fine-tuned for multiple choice question answering (mc) and mathematical reasoning (gsm8k). https://arxiv.org/abs/2407.07890