OpenBA-V2: 3B LLM (Large Language Model) with T5 architecture, utilizing model pruning technique and continuing pretraining from OpenBA-15B.
-
Updated
May 10, 2024 - Python
OpenBA-V2: 3B LLM (Large Language Model) with T5 architecture, utilizing model pruning technique and continuing pretraining from OpenBA-15B.
pytorch implementation of several CNNs for image classification
Experimental GPT-2 scale (~124M param) LLM trained from scratch. Trained on 22B tokens od Cosmopedia Dataset. Includes full training pipeline, with SFT FineTuning and log analysis tools with backend and frontend and deployment
Add a description, image, and links to the train-from-scratch topic page so that developers can more easily learn about it.
To associate your repository with the train-from-scratch topic, visit your repo's landing page and select "manage topics."