1Show HN: New LLM Pre-Training and Post-Training Paradigms (opens in new tab)(sebastianraschka.com)2rasbt1y ago0Save
2Developing an LLM: Building, Training, Finetuning (A 1h Video Explainer) (opens in new tab)(youtube.com)Video43rasbt2y ago12Save
3Evaluating LLMs locally, on a laptop, with Llama 3 and Ollama (opens in new tab)(github.com)GitHub2rasbt2y ago0Save
4Understanding the LLM Development Cycle: Building, Training, Finetuning (opens in new tab)(magazine.sebastianraschka.com)3rasbt2y ago0Save
5The latest major open LLM releases: Mixtral, Llama 3, Phi-3, and OpenELM (opens in new tab)(magazine.sebastianraschka.com)5rasbt2y ago0Save
6Finetuning an LLM-Based Spam Classifier with LoRA from Scratch (opens in new tab)(github.com)GitHub14rasbt2y ago0Save
7Finetune a GPT Model for Spam Detection on Your Laptop in Just 5 Minutes (opens in new tab)(github.com)GitHub3rasbt2y ago0Save
8Insights from Finetuning LLMs for Classification Tasks (opens in new tab)(github.com)GitHub2rasbt2y ago0Save
9Tips for LLM Pretraining and Evaluating Reward Models (opens in new tab)(sebastianraschka.com)2rasbt2y ago0Save
10Comparing 5 ways to implement Multihead Attention in PyTorch (opens in new tab)(github.com)GitHub3rasbt2y ago0Save
11AI Research in Feb 2024 – LoRA Successor, "Small" LLMs, Transparent LLM Research (opens in new tab)(sebastianraschka.com)3rasbt2y ago0Save
13Implementing Weight-Decomposed Low-Rank Adaptation (DoRA) from Scratch (opens in new tab)(magazine.sebastianraschka.com)96rasbt2y ago10Save
14AI Research Papers in Jan 2024: Model Merging, Mixtures of Experts, Smaller LLMs (opens in new tab)(magazine.sebastianraschka.com)20rasbt2y ago0Save
15Implementing a ChatGPT-like LLM from scratch, step by step (opens in new tab)(github.com)GitHub739rasbt2y ago98Save