All Posts - Naifan Li's Blog

2026

Tmux 02-14

VAD 02-09

HuggingFace 02-03

2025

Scaling Law 12-25

LLaVA Mini 12-25

Instruct Tuning 12-25

Vision Language Adapter 12-24

2024

LLaMA 4: Next-Generation Open Language Models 12-25

Qwen2.5 Technical Report 12-19

Qwen2-VL: Enhancing Vision-Language Model’s Perception of the World at Any Resolution 09-25

Qwen2 Technical Report 07-15

AI-Powered Data-Centric System: Evolving from Data Closed-loop to World Simulation 06-16

2023

Qwen Technical Report 09-28

MetaCLIP: Demystifying CLIP Data 09-28

Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond 08-25

LLaMA 2: Open Foundation and Fine-Tuned Chat Models 07-18

LIMA: Less Is More for Alignment 05-18

2022

Self-Instruct: Aligning Language Model with Self-Generated Instructions 12-20

Zero-shot-CoT: Large Language Models are Zero-Shot Reasoners 05-24

Least-to-Most Prompting Enables Complex Reasoning in Large Language Models 05-21

Self-Consistency Improves Chain of Thought Reasoning in Language Models 03-21

InstructGPT 03-04

2021

FLAN: Finetuned Language Models Are Zero-Shot Learners 09-03

Summary: Sequence Packing 07-29

CLIP: Learning Transferable Visual Models From Natural Language Supervision 02-26

2020

GPT-3: Language Models are Few-Shot Learners 05-28

2019

T5: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer 10-23

GPT-2: Language Models are Unsupervised Multitask Learners 02-14

BBPE: Byte-level Byte Pair Encoding 02-14

2018

GPT-1: Improving Language Understanding by Generative Pre-Training 06-11

Conda 06-03

uv-pip 06-03

SSH 06-03

Character Encoding 06-03

2017

Transformer: Attention Is All You Need 06-12

Taxonomy of Natural Language Processing Tasks 06-12