Skip to content

Setting Up Ollama for Hands-On AI

This guide will walk you through the simple process of setting up Ollama on your local machine to power your Hands-On AI projects.

Great instinct! Since the docstring update was already complete, there's no change needed in your Python code.

Now, for your Ollama guide, here’s what you can add based on your idea:


🧭 Who Is This Guide For?

This guide is written for beginners who want to run Hands-On AI with a local LLM using Ollama. If you're an educator or student new to large language models, this will walk you through everything step by step.

If you're an advanced user with an existing Ollama setup (custom models, non-default ports, Docker, remote hosting), feel free to skip this guide — Hands-On AI will work as long as your Ollama server is reachable.

What is Ollama?

Ollama is a lightweight tool that allows you to run large language models (LLMs) locally on your computer. It makes it easy to download and run models like Llama 3 without needing specialized hardware or complex setup.

System Requirements

  • Windows, macOS, or Linux computer
  • At least 8GB RAM (16GB+ recommended for better performance)
  • At least 10GB of free disk space
  • An internet connection (for the initial model download)

Installation Guide

For macOS

  1. Download the Ollama installer from ollama.ai
  2. Open the downloaded file and drag the Ollama app to your Applications folder
  3. Launch Ollama from your Applications folder
  4. Ollama will run in your menu bar (look for the llama icon)

For Windows

  1. Download the Windows installer from ollama.ai
  2. Run the installer and follow the on-screen instructions
  3. Once installed, Ollama will start automatically and run in the system tray

For Linux

  1. Run the following command in your terminal:
    curl -fsSL https://ollama.ai/install.sh | sh
    
  2. Start the Ollama service:
    ollama serve
    

Downloading Your First Model

After installing Ollama, you'll need to download at least one model. For beginners, we recommend the llama3 model, which offers a good balance of performance and quality.

  1. Open a terminal or command prompt
  2. Run the following command:
    ollama pull llama3
    
  3. Wait for the download to complete (this may take a few minutes depending on your internet speed)

The first time you run this command, it will download the model files (about 4GB). Once downloaded, the model will be available locally on your computer.

Testing Your Installation

Let's make sure everything is working properly:

  1. In your terminal or command prompt, run:
    ollama run llama3 "Hello, how are you today?"
    
  2. You should see a response from the model

If you received a response, congratulations! Ollama is set up correctly and ready to use with ChatCraft.

Using Ollama with ChatCraft

ChatCraft is designed to work with Ollama by default. As long as Ollama is running in the background, ChatCraft will automatically connect to it at http://localhost:11434.

No additional configuration is required unless you've changed Ollama's default settings.

🧪 Test Your Installation

Once you’ve started a model using:

ollama run llama3

You can test Hands-On AI in Python with:

from hands_on_ai.chat import get_response
print(get_response("What is 3 + 4?"))

You should get a real-time AI response!


✅ Optional Tip (Advanced Config):

⚙️ Advanced: Custom Model Host

If you're running Ollama on a different host or port (e.g. remote or Docker), you can create a config file to tell Hands-On AI where to send requests.

See: docs/configuration.md for advanced setup.

Troubleshooting

"Connection refused" error

If Hands-On AI shows a connection error: 1. Make sure Ollama is running 2. Check that you haven't changed the default port (11434) 3. On Windows or macOS, you might need to restart the Ollama application

Slow responses

  1. Try a smaller model like llama3:8b which requires fewer resources
  2. Close other resource-intensive applications
  3. Ensure your computer meets the minimum requirements

"Out of memory" error

  1. Try a smaller model like llama3:8b
  2. Increase your system's virtual memory (swap file)
  3. Close other applications to free up RAM

Available Models

Here are some models you can use with ChatCraft:

  • llama3 - The recommended default model for most users
  • llama3:8b - A smaller, faster version if you have limited resources
  • codellama - Specialised for programming tasks (recommended for code-related exercises)
  • mistral - An alternative model with good performance
  • phi - A smaller model with good capabilities for simpler tasks

To download any of these models, use:

ollama pull model_name

Need More Help?

For more detailed information about Ollama, visit their official documentation at ollama.ai/docs.



Now that you have Ollama set up, you're ready to use Hands-On AI and start creating your own AI educational experiences!