rasbt on Hacker News

1

Show HN: New LLM Pre-Training and Post-Training Paradigms (opens in new tab)

(sebastianraschka.com)

2rasbt1y ago0

2

Developing an LLM: Building, Training, Finetuning (A 1h Video Explainer) (opens in new tab)

(youtube.com)Video

43rasbt2y ago12

3

Evaluating LLMs locally, on a laptop, with Llama 3 and Ollama (opens in new tab)

(github.com)GitHub

2rasbt2y ago0

4

Understanding the LLM Development Cycle: Building, Training, Finetuning (opens in new tab)

(magazine.sebastianraschka.com)

3rasbt2y ago0

5

The latest major open LLM releases: Mixtral, Llama 3, Phi-3, and OpenELM (opens in new tab)

(magazine.sebastianraschka.com)

5rasbt2y ago0

6

Finetuning an LLM-Based Spam Classifier with LoRA from Scratch (opens in new tab)

(github.com)GitHub

14rasbt2y ago0

7

Finetune a GPT Model for Spam Detection on Your Laptop in Just 5 Minutes (opens in new tab)

(github.com)GitHub

3rasbt2y ago0

8

Insights from Finetuning LLMs for Classification Tasks (opens in new tab)

(github.com)GitHub

2rasbt2y ago0

9

Tips for LLM Pretraining and Evaluating Reward Models (opens in new tab)

(sebastianraschka.com)

2rasbt2y ago0

10

Comparing 5 ways to implement Multihead Attention in PyTorch (opens in new tab)

(github.com)GitHub

3rasbt2y ago0

11

AI Research in Feb 2024 – LoRA Successor, "Small" LLMs, Transparent LLM Research (opens in new tab)

(sebastianraschka.com)

3rasbt2y ago0

12

Understanding, using, and finetuning Gemma (opens in new tab)

(lightning.ai)

118rasbt2y ago48

13

Implementing Weight-Decomposed Low-Rank Adaptation (DoRA) from Scratch (opens in new tab)

(magazine.sebastianraschka.com)

96rasbt2y ago10

14

AI Research Papers in Jan 2024: Model Merging, Mixtures of Experts, Smaller LLMs (opens in new tab)

(magazine.sebastianraschka.com)

20rasbt2y ago0

15

Implementing a ChatGPT-like LLM from scratch, step by step (opens in new tab)

(github.com)GitHub

739rasbt2y ago98

rasbt

Recent submissions

Show HN: New LLM Pre-Training and Post-Training Paradigms (opens in new tab)

Developing an LLM: Building, Training, Finetuning (A 1h Video Explainer) (opens in new tab)

Evaluating LLMs locally, on a laptop, with Llama 3 and Ollama (opens in new tab)

Understanding the LLM Development Cycle: Building, Training, Finetuning (opens in new tab)

The latest major open LLM releases: Mixtral, Llama 3, Phi-3, and OpenELM (opens in new tab)

Finetuning an LLM-Based Spam Classifier with LoRA from Scratch (opens in new tab)

Finetune a GPT Model for Spam Detection on Your Laptop in Just 5 Minutes (opens in new tab)

Insights from Finetuning LLMs for Classification Tasks (opens in new tab)

Tips for LLM Pretraining and Evaluating Reward Models (opens in new tab)

Comparing 5 ways to implement Multihead Attention in PyTorch (opens in new tab)

AI Research in Feb 2024 – LoRA Successor, "Small" LLMs, Transparent LLM Research (opens in new tab)

Understanding, using, and finetuning Gemma (opens in new tab)

Implementing Weight-Decomposed Low-Rank Adaptation (DoRA) from Scratch (opens in new tab)

AI Research Papers in Jan 2024: Model Merging, Mixtures of Experts, Smaller LLMs (opens in new tab)

Implementing a ChatGPT-like LLM from scratch, step by step (opens in new tab)

Recent submissions

Show HN: New LLM Pre-Training and Post-Training Paradigms (opens in new tab)

Developing an LLM: Building, Training, Finetuning (A 1h Video Explainer) (opens in new tab)

Evaluating LLMs locally, on a laptop, with Llama 3 and Ollama (opens in new tab)

Understanding the LLM Development Cycle: Building, Training, Finetuning (opens in new tab)

The latest major open LLM releases: Mixtral, Llama 3, Phi-3, and OpenELM (opens in new tab)

Finetuning an LLM-Based Spam Classifier with LoRA from Scratch (opens in new tab)

Finetune a GPT Model for Spam Detection on Your Laptop in Just 5 Minutes (opens in new tab)

Insights from Finetuning LLMs for Classification Tasks (opens in new tab)

Tips for LLM Pretraining and Evaluating Reward Models (opens in new tab)

Comparing 5 ways to implement Multihead Attention in PyTorch (opens in new tab)

AI Research in Feb 2024 – LoRA Successor, "Small" LLMs, Transparent LLM Research (opens in new tab)

Understanding, using, and finetuning Gemma (opens in new tab)

Implementing Weight-Decomposed Low-Rank Adaptation (DoRA) from Scratch (opens in new tab)

AI Research Papers in Jan 2024: Model Merging, Mixtures of Experts, Smaller LLMs (opens in new tab)

Implementing a ChatGPT-like LLM from scratch, step by step (opens in new tab)