Piyush Maharana's picture

In a Training Loop 🔄

Piyush Maharana

catastropiyush

·

https://catastropiyush.github.io/

catastropiyush

AI & ML interests

LLMs for scientific data extraction, Solid State Hydrogen Storage,Machine Learning

Recent Activity

upvoted an article 1 day ago

**An Edge-First Generalized LLM LoRA Fine-Tuning Framework for Heterogeneous GPUs**

upvoted an article 6 days ago

BERTs that chat: turn any BERT into a chatbot with dLLM

commented on an article 28 days ago

TorchSim: A new PyTorch-based molecular dynamics engine

View all activity

Organizations

catastropiyush 's models 19

catastropiyush/dqn-SpaceInvadersNoFrameskip-v4

Reinforcement Learning • Updated May 6 • 8

catastropiyush/gemma-3-finetune_GRPO

Text Generation • 1.0B • Updated May 5 • 7

catastropiyush/llama3_1_GRPO

catastropiyush/SmolLM2-FT-MyDataset

Text Generation • 0.1B • Updated Jan 22 • 9

catastropiyush/Reinforce_Pixelcopter

catastropiyush/Reinfroce_model

Reinforcement Learning • Updated Jan 21

catastropiyush/llama_3_2_vision

catastropiyush/llama-3.2-3B_instruct_Q4_K_M

3B • Updated Nov 8, 2024 • 12

catastropiyush/Marcoro14-7B-slerp

Updated Jun 5, 2024

catastropiyush/Llama-3_8b_Alpaca

Text Generation • Updated Jun 5, 2024 • 8

catastropiyush/Taxi

Reinforcement Learning • Updated Jun 2, 2024

catastropiyush/q-FrozenLake-v1-4x4-noSlippery

Reinforcement Learning • Updated Jun 2, 2024

catastropiyush/Mistral_7B_shell

Updated May 14, 2024

catastropiyush/idefics2-8b-docvqa-finetuned-tutorial

Updated May 10, 2024

catastropiyush/llama-3_8b_Q5_K_M

8B • Updated May 1, 2024 • 28

catastropiyush/moondream_finetune_example

Text Generation • 2B • Updated Apr 22, 2024 • 9

catastropiyush/Alpaca_Mistral_finetune_GGUF

7B • Updated Apr 20, 2024 • 11

catastropiyush/ppo-Huggy

Reinforcement Learning • Updated Apr 2, 2024 • 75

catastropiyush/ppo-LunarLander-v2

Reinforcement Learning • Updated Mar 12, 2024 • 2