Lightnews — Scholar-powered news

Chip Huyen

@chiphuyen.bsky.social

5.8K followers 42 following 14 posts

AI x storytelling
AI Engineering: https://amazon.com/dp/1098166302
Designing ML Systems: http://amazon.com/dp/1098107969
@chipro

Posts Replies Media Videos

Chip Huyen

@chiphuyen.bsky.social

O'Reilly said the first physical copies would appear around Dec 22 but my copies arrive on Jan 7 :(

December 13, 2024 at 7:37 AM

Chip Huyen

@chiphuyen.bsky.social

6. AI Incident Database

For those interested in seeing how AI can go wrong, this contains over 3000 reports of AI harms: incidentdatabase.ai

Welcome to the Artificial Intelligence Incident Database

The starting point for information about the AI Incident Database

incidentdatabase.ai

December 13, 2024 at 4:06 AM

Chip Huyen

@chiphuyen.bsky.social

5. Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models (Lu et al., 2023)

A cool study on LLM planners, how they use tools, and their failure modes. An interesting finding is that different LLMs have different tool preferences: arxiv.org/abs/2304.09842

Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models

Large language models (LLMs) have achieved remarkable progress in solving various natural language processing tasks due to emergent reasoning abilities. However, LLMs have inherent limitations as they...

arxiv.org

December 13, 2024 at 4:06 AM

Chip Huyen

@chiphuyen.bsky.social

4. Efficiently Scaling Transformer Inference (Pope et al., 2022)

An amazing paper about inference optimization for transformers. It provides a guideline to optimize for different aspects, e.g. lowest possible latency, highest possible throughput, or longest context length: arxiv.org/abs/2211.05102

Efficiently Scaling Transformer Inference

We study the problem of efficient generative inference for Transformer models, in one of its most challenging settings: large deep models, with tight latency targets and long sequence lengths. Better ...

arxiv.org

December 13, 2024 at 4:05 AM

Chip Huyen

@chiphuyen.bsky.social

3. Llama 3 paper

The section on post-training data is a gold mine! It details different techniques they used to generate 2.7M examples for instruction finetuning. It also covers synthetic data verification! arxiv.org/abs/2407.21783

The Llama 3 Herd of Models

Modern artificial intelligence (AI) systems are powered by foundation models. This paper presents a new set of foundation models, called Llama 3. It is a herd of language models that natively support ...

arxiv.org

December 13, 2024 at 4:05 AM

Chip Huyen

@chiphuyen.bsky.social

2. OpenAI’s best practices for finetuning

While this guide focuses on GPT-3, many techniques are applicable to finetuning in general. It explains how finetuning works, how to prepare training data, how to pick hyperparameters, and common finetuning mistakes: docs.google.com/document/d/1...

[PUBLIC] Best practices for fine-tuning GPT-3 to classify text

This document is a draft of a guide that will be added to a future revision of the OpenAI documentation. If you have any feedback, feel free to let us know. One note: this doc shares metrics for text...

docs.google.com

December 13, 2024 at 4:05 AM

Chip Huyen

@chiphuyen.bsky.social

The highlights:

1. Anthropic’s Prompt Engineering Interactive Tutorial

The Google Sheets-based interactive exercises make it easy to experiment with different prompts. docs.google.com/spreadsheets...

Anthropic's Prompt Engineering Interactive Tutorial [PUBLIC ACCESS]

docs.google.com

December 13, 2024 at 4:05 AM

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news