Strong Compute reposted this
Thrilled to share that our team — Ramprasadh Kumar, Bernett Orlando, and I — won the DeepSeek AI Fine-tuning Hackathon, hosted by Strong Compute. We developed a novel approach to fine-tune DeepSeek for a Math AI Tutor use case, enabling the transformation of natural language prompts into conceptual visualizations through Python-generated animations. We played around with a bunch of GRPO reward modeling strategies — super fun and a great mix of technical challenge and creative thinking. 😄