feat: Add FLAN-T5 compatibility with relative position bias b43ba56 OliverPerrin commited on 18 days ago
Refactor: Consolidate dependencies, improve testing, and add CI/CD d18b34d OliverPerrin commited on 19 days ago
Implemented the following parts for the Transformer model: poisional_encoding, feedfoward, encoder, and skeleton of decoder. As well as Test cases for each class respectively and visualizations 5a20c96 OliverPerrin commited on Oct 29
Implemented ScaledDotProduct Attention and Multi-Head Attention 204fb3c OliverPerrin commited on Oct 27