How was the AI server Terry designed?

Terry was designed with back chat histories, multiple models, and stable diffusion.

host ALL your AI locally

NetworkChuck・3 minutes read

An individual created an AI server named Terry with back chat histories, multiple models, and the ability to add stable diffusion, used for local AI purposes. The server setup was demonstrated on a standard laptop equipped with specific components, including an ASUS X670 E Creator Pro Art motherboard and AMD Ryzen 9 7950X processor, with practical steps involving downloading Alama AI software and setting up models for personalized AI interactions in the notes application Obsidian.

Insights

The AI server named Terry was built with a focus on running AI locally, featuring back chat histories, multiple models, and integration with the notes application Obsidian, showcasing a comprehensive and user-friendly design tailored for personal use.
Practical steps for setting up the local AI server involved downloading Alama AI software, deploying Open Web UI within a Docker container, and customizing models for specific users, highlighting a seamless process that enables diverse interactions and content additions while enhancing productivity and creativity through personalized AI interactions within notes.

Get key ideas from YouTube videos. It’s free

Summary

00:00

"Building Terry: Local AI Server Setup"

The individual built an AI server initially for personal use, with a focus on running AI locally with a graphical user interface and chat interface.
The AI server, named Terry, was designed to have back chat histories, multiple models, and the ability to add stable diffusion, integrated into the notes application Obsidian.
The server setup demonstration was conducted on a standard laptop, indicating that a regular computer could suffice for the task.
Terry, the AI server, was equipped with an ASUS X670 E Creator Pro Art motherboard, an AMD Ryzen 9 7950X processor, 128GB of DDR5 Neo memory, and two MSI Sremm 4090 GPUs.
The server's components included a Leon Lee Zero 11 Dynamic EVO XL case, a Leon Lee water cooler for the CPU, two Samsung 990 Pro 2TB SSDs, and a Corsair AX1600i power supply.
Initial installation attempts with Ubuntu were unsuccessful, leading to the adoption of Pop Os by System76, which worked seamlessly with Nvidia drivers pre-installed.
Practical steps for setting up a local AI server involved downloading the Alama AI software, with specific instructions for Windows users to utilize the Linux version.
The installation process included updating packages, installing Alama with a single command, and testing its functionality through a web browser by accessing localhost and port 11434.
Adding an AI model to Alama was achieved through the Alama Pull command, followed by testing the model with Alama Run to interact with a chat GPT AI without internet connectivity.
The subsequent step involved deploying the Open Web UI for Alama within a Docker container, requiring Docker installation and running a Docker Run command to integrate the UI with the AI server.

10:14

"Customize LAMA CRL Connection and Models"

To check the connection, click on the drop-down menu and select "LAMA two."
Verify the connection by accessing the settings through the icon at the bottom left.
The LAMA-based CRL connection can be changed if needed.
Download additional models from LAMA by typing "LAMA pull code Gemma" in the command line.
Switch between models by selecting the desired one from the drop-down menu.
Explore various features such as editing responses, copying, liking, and disliking responses in the chat.
Mention another model in the chat to initiate a conversation between models.
Upload files by clicking on the plus sign, allowing for various content additions.
Install the multimodal model "LAVA" by pulling it down and changing the model in the browser.
Create and customize models for specific users, like "Deborah," to restrict access to certain prompts or content.

21:02

Instant image generation and AI integration in notes.

By clicking on a specific prompt, an image can be generated instantly, showcasing a fascinating and slightly intimidating process.
Within Open Web UI, a feature named Documents allows for easy addition of documents, facilitating quick access during chats by simply using hashtags.
Obsidian, a preferred notes application, offers the ability to integrate local GPT models like B-M-O Chatbot, enabling personalized AI interactions within notes, enhancing productivity and creativity.