I learn when sharing, I share when learning

HOME
CATEGORIES
TAGS
ARCHIVES
ABOUT

Home Archives

Archives

Archives

2025

02 Dec Entropic Instruction Following: Does Semantic Coherence Help LLMs Follow Instructions?
26 Oct Elements Of Mechanistic Interpretability: From Observation to Causation
30 Aug SFT vs. DPO (/ RLHF)- A Visual Guide to What Your LLM Actually Learns
22 Jun Do You Need A Matryoshka Model?
31 May Chunking Strategies for RAG - Breaking Down Documents for Better Retrieval
21 Apr Speculative Decoding - Making Language Models Generate Faster Without Losing Their Minds
16 Mar Mixture of Experts – Scaling Transformers Without Breaking the FLOPS Bank
08 Feb Doing MORE To consume LESS – Flash Attention V1
04 Jan Guidance – Structuring your outputs is easier than you think

2024

23 Dec A beginner's guide to Vision Language Models (VLMs)
21 Nov KV cache – The how not to waste your FLOPS starter
11 Nov Attention scores, Scaling and Softmax
01 Nov The Hidden Beauty of Sinusoidal Positional Encodings in Transformers
28 Oct Vanishing and exploding Gradients – A non-flat-earther's perspective.
25 Oct Teaching an AI to Drive a Taxi – A Friendly Guide to Q-Learning
22 Oct Recall and Precision – A Practical Case Against Memorization

Recently Updated

Entropic Instruction Following: Does Semantic Coherence Help LLMs Follow Instructions?
Recall and Precision – A Practical Case Against Memorization
Elements Of Mechanistic Interpretability: From Observation to Causation
SFT vs. DPO (/ RLHF)- A Visual Guide to What Your LLM Actually Learns
Do You Need A Matryoshka Model?

Trending Tags

transformers tutorial NLP python inference LLM math efficiency Embeddings LLMs

© 2025 sifal. Some rights reserved.

Using the Chirpy theme for Jekyll.

Trending Tags

transformers tutorial NLP python inference LLM math efficiency Embeddings LLMs

A new version of content is available.