Posts

Llama 3.2 (3B) - New Sizes for Different Needs

The Llama 3.2 family of models released by Meta AI focuses on two key use cases: vision LLMs (11B and 90B) and edge/small LLMs (1B and 3B). With good-looking benchmarks, how good can a local LLM be?

Oct 05, 2024

Better Context for your RAG with Contextual Retrieval

What if your chunks in RAG were providing better context? Let's explore how to use contextual retrieval to enhance your RAG systems.

Sep 28, 2024

Flux.1-Dev - Photorealistic (and Cute) Images

Did you have trouble making great images with AI? Maybe you were using the wrong model.

Sep 01, 2024

GPT-4o API Deep Dive: Text Generation, Vision, and Function Calling

The latest OpenAI model, GPT-4o, is here! What are the new features and improvements?

May 21, 2024

Llama 3 - Open Model That Is Truly Useful?

The third iteration of the Llama model is here! How does it compare to the previous versions? What are the new features and improvements? Let's find out!

Apr 28, 2024

Run Your Own AI (Mixtral) on Your Machine

Discover how hosting Mixtral, an open-source large language model, on a cloud instance can help you own your AI.

Apr 14, 2024

Experiment Tracking with DVC

Running experiments and doing hyperparameter tuning is a crucial part of machine learning. In this post, we will see how to use DVC to track experiments and manage the model files.

Apr 7, 2024

MemGPT - Unlimited Context (Memory) for LLMs

One of the largest (no pun intended) limitations of Large Language Models (LLMs) is their context window size. How can you overcome the limited token context window?

Dec 30, 2023

Chat with Multiple PDFs using Llama 2 and LangChain

Can you build a chatbot that can answer questions from multiple PDFs? Can you do it with a private LLM?

Dec 28, 2023

AutoGen - Build Powerful AI Agents with ChatGPT/GPT-4

We'll explore AutoGen, a Microsoft library that lets you create LLM applications with agents. These agents can communicate and help you solve complex tasks.

Dec 27, 2023

CryptoGPT: Crypto Twitter Sentiment Analysis

In this tutorial, we'll dive into a fascinating project that combines Streamlit, ChatGPT, and LangChain to analyze the sentiment of tweets related to cryptocurrencies.

Dec 26, 2023

LLaVA - Large Multimodal Model

Large Language Models (LLMs) allow us to generate text, but they only take text as an input. Large Multimodal Models (LMM) can take both text and image as an input, and generate text based on both.

Oct 27, 2023

LangChain QuickStart with Llama 2

LangChain helps you to tackle a significant limitation of LLMs—utilizing external data and tools. Getting started is a breeze. Let's dive in!

Sep 24, 2023

Fine-tuning Llama 2 on a Custom Dataset

Can you make LLMs work better for your specific task? Yes, you can! In this tutorial, you'll learn how to fine-tune Llama 2 on a custom dataset using the QLoRA technique

Sep 6, 2023

Deploy Your Private Llama 2 Model to Production with RunPod

In this tutorial, you'll learn the steps to deploy your very own Llama 2 instance and set it up for private use using the RunPod cloud platform.

Jul 31, 2023

Private Chatbot with Local LLM (Falcon 7B) and LangChain

Can you build a private Chatbot with ChatGPT-like performance using a local LLM on a single GPU? We'll use Falcon 7B with LangChain to build a chatbot!

Jul 18, 2023

Support Chatbot using Custom Knowledge Base with LangChain and Open LLM

Can you build a chatbot that can answer questions about your product or service? What if you could use your existing knowledge base to train the chatbot?

Jul 11, 2023

Deploy LLM to Production with HuggingFace Inference Endpoints

How to deploy a fine-tuned Falcon 7B LLM with QLoRA to production? After training Falcon 7B with QLoRA on a custom dataset, the next step is deploying the model to production.

Jun 22, 2023

Faster LLM Inference

Can you optimize the inference time of your LLM? How? Let's explore strategies to enhance the inference speed of your LLM.

Jun 11, 2023

Fine-tuning Large Language Model (LLM) on a Custom Dataset with QLoRA

Can you train your own LLM using your own data? Can you accomplish this without sharing your data with third-party companies or APIs?

Jun 4, 2023

Private GPT4All: Chat with PDF Files Using Free LLM

Have concerns about data privacy while using ChatGPT? Want an alternative to cloud-based language models that is both powerful and free? Look no further than GPT4All.

May 20, 2023

Exploring the ChatGPT API with Python

Learn how to use OpenAI's ChatGPT API to generate responses to prompts. This tutorial covers various features of the API, including token limits, multiple completions, and temperature control.

Apr 27, 2023

Fine-tuning Alpaca and LLaMA: Training on a Custom Dataset

In this tutorial, we will explore the process of fine-tuning Alpaca LoRa for detecting sentiment in Bitcoin tweets.

Apr 10, 2023

Alpaca and LLaMA: Inference and Evaluation

Can we achieve ChatGPT-like performance by fine-tuning a smaller model? Welcome to the tutorial on how to use the Stanford Alpaca model for conversational AI.

Apr 10, 2023

Document Classification with LayoutLMv3

We'll explore the task of document classification using layout information and image content. We will use the LayoutLMv3 model, a state-of-the-art model for this task, and PyTorch Lightning, a lightweight PyTorch wrapper for high-performance training.

Mar 24, 2023