Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
In a Training Loop 🔄
65.1
TFLOPS
2
45
82
Piyush Maharana
catastropiyush
Follow
fractalego's profile picture
Shakil2448868's profile picture
thomwolf's profile picture
20 followers
·
75 following
https://catastropiyush.github.io/
catastropiyush
AI & ML interests
LLMs for scientific data extraction, Solid State Hydrogen Storage,Machine Learning
Recent Activity
upvoted
an
article
1 day ago
**An Edge-First Generalized LLM LoRA Fine-Tuning Framework for Heterogeneous GPUs**
upvoted
an
article
6 days ago
BERTs that chat: turn any BERT into a chatbot with dLLM
commented
on
an
article
28 days ago
TorchSim: A new PyTorch-based molecular dynamics engine
View all activity
Organizations
catastropiyush
's models
19
Sort: Recently updated
catastropiyush/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
May 6
•
8
catastropiyush/gemma-3-finetune_GRPO
Text Generation
•
1.0B
•
Updated
May 5
•
7
catastropiyush/llama3_1_GRPO
Updated
Feb 7
catastropiyush/SmolLM2-FT-MyDataset
Text Generation
•
0.1B
•
Updated
Jan 22
•
9
catastropiyush/Reinforce_Pixelcopter
Updated
Jan 21
catastropiyush/Reinfroce_model
Reinforcement Learning
•
Updated
Jan 21
catastropiyush/llama_3_2_vision
Updated
Jan 7
catastropiyush/llama-3.2-3B_instruct_Q4_K_M
3B
•
Updated
Nov 8, 2024
•
12
catastropiyush/Marcoro14-7B-slerp
Updated
Jun 5, 2024
catastropiyush/Llama-3_8b_Alpaca
Text Generation
•
Updated
Jun 5, 2024
•
8
catastropiyush/Taxi
Reinforcement Learning
•
Updated
Jun 2, 2024
catastropiyush/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Jun 2, 2024
catastropiyush/Mistral_7B_shell
Updated
May 14, 2024
catastropiyush/idefics2-8b-docvqa-finetuned-tutorial
Updated
May 10, 2024
catastropiyush/llama-3_8b_Q5_K_M
8B
•
Updated
May 1, 2024
•
28
catastropiyush/moondream_finetune_example
Text Generation
•
2B
•
Updated
Apr 22, 2024
•
9
catastropiyush/Alpaca_Mistral_finetune_GGUF
7B
•
Updated
Apr 20, 2024
•
11
catastropiyush/ppo-Huggy
Reinforcement Learning
•
Updated
Apr 2, 2024
•
75
catastropiyush/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Mar 12, 2024
•
2