Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

Open-source LLMs: Uncensored & secure AI locally with RAG

Name: Open-source LLMs: Uncensored & secure AI locally with RAG
Rating: 4.7 (1884 reviews)

Private ChatGPT Alternatives: Llama3, Mistral a. more with Function Calling, RAG, Vector Databases, LangChain, AI-Agents

Created byArnold Oberleiter

Last updated 11/2025

English

What you'll learn

Why Open-Source LLMs? Differences, Advantages, and Disadvantages of Open-Source and Closed-Source LLMs
What are LLMs like ChatGPT, Llama, Mistral, Phi3, Qwen2-72B-Instruct, Grok, Gemma, etc.
Which LLMs are available and what should I use? Finding "The Best LLMs"
Requirements for Using Open-Source LLMs Locally
Installation and Usage of LM Studio, Anything LLM, Ollama, and Alternative Methods for Operating LLMs
Censored vs. Uncensored LLMs
Finetuning an Open-Source Model with Huggingface or Google Colab
Vision (Image Recognition) with Open-Source LLMs: Llama3, Llava & Phi3 Vision
Hardware Details: GPU Offload, CPU, RAM, and VRAM
All About HuggingChat: An Interface for Using Open-Source LLMs
System Prompts in Prompt Engineering + Function Calling
Prompt Engineering Basics: Semantic Association, Structured & Role Prompts
Groq: Using Open-Source LLMs with a Fast LPU Chip Instead of a GPU
Vector Databases, Embedding Models & Retrieval-Augmented Generation (RAG)
Creating a Local RAG Chatbot with Anything LLM & LM Studio
Linking Ollama & Llama 3, and Using Function Calling with Llama 3 & Anything LLM
Function Calling for Summarizing Data, Storing, and Creating Charts with Python
Using Other Features of Anything LLM and External APIs
Tips for Better RAG Apps with Firecrawl for Website Data, More Efficient RAG with LlamaIndex & LlamaParse for PDFs and CSVs
Definition and Available Tools for AI Agents, Installation and Usage of Flowise Locally with Node (Easier Than Langchain and LangGraph)
Creating an AI Agent that Generates Python Code and Documentation, and Using AI Agents with Function Calling, Internet Access, and Three Experts
Hosting and Usage: Which AI Agent Should You Build and External Hosting, Text-to-Speech (TTS) with Google Colab
Finetuning Open-Source LLMs with Google Colab (Alpaca + Llama-3 8b, Unsloth)
Renting GPUs with Runpod or Massed Compute
Security Aspects: Jailbreaks and Security Risks from Attacks on LLMs with Jailbreaks, Prompt Injections, and Data Poisoning
Data Privacy and Security of Your Data, as well as Policies for Commercial Use and Selling Generated Content

Course content

9 sections • 88 lectures • 10h 2m total length

Welcome0:20
Course Overview6:53
Explore open source LLMs and run them locally, compare censored and uncensored LMS, master prompt engineering, and harness vector databases and retrieval augmented generation for data privacy and security.
Updates0:07
My Goal and Some Tips3:38
Master open-source llms from the ground up, learn to run models locally. Prioritize data privacy and security, while exploring uncensored dolphin models and building an ai community.
Explanation of the Links3:14
Explore links for open-source LLMs and LMS, including Huggingface benchmarks and meta llama models. Learn to access and download PDFs, run Colab notebooks, and set up prompts and AI agents.
Important Links0:42
Instructor Introduction: Arnold Oberleiter (Arnie)1:20
Arnold Oberleitner, aka Arnie, introduces his background in AI, languages, and business, highlighting his work with chatbots, AI agents, and automations for companies, German communities, and small businesses.

What is this Section about?0:35
Learn basics of llms, how they work, and what tokens are; compare advantages and disadvantages of open-source vs closed-source options to pick best llm for local ai with rag.
What are LLMs like ChatGPT, Llama, Mistral, etc.14:59
Understand how LLMs work through an open-source llama two example: 70-billion-parameter model trained on ten terabytes, stored as a compact parameter file with a small run file for local use.
Which LLMs are available and what should I use: Finding "The Best LLMs"7:16
Compare open-source and closed-source LLMs using the LMS arena leaderboard, identify top models by category, and learn how to pick the best open-source options for coding and multilingual tasks.
Disadvantages of Closed-Source LLMs like ChatGPT, Gemini, and Claude10:44
Analyze privacy risks, data training implications, and ongoing costs of closed-source LLMs like ChatGPT, Gemini, and Claude, including limited customization, internet dependence, latency, security concerns, and lack of transparency.
Advantages and Disadvantages of Open-Source LLMs like Llama3, Mistral & more3:30
Explore the advantages and disadvantages of open-source LLMs like Llama3 and Mistral, including data privacy, offline operation, customization, cost savings, and the need for capable hardware.
OpenSoure LLMs get betther! DeepSeek R1 Infos0:35
Recap: Don't Forget This!!3:08
Explore open-source LLMs and how neural nets enable local, private use. Learn about parameter files, GPUs, fine-tuning, and the advantages of offline, customizable models.

Requirements for Using Open-Source LLMs Locally: GPU, CPU & Quantization7:28
Identify hardware requirements for running open-source llms locally, including gpu power with cuda, ram, vram, cpu, storage, and operating systems, and learn quantization to run big models on smaller gpus.
Installing LM Studio and Alternative Methods for Running LLMs8:10
Discover open-source LLM options and learn to run models locally with LM Studio or Alarma, including setup, hardware needs, and privacy features.
Using Open-Source Models in LM Studio: Llama 3, Mistral, Phi-3 & more18:51
Explore how to download, install, and use open-source models in LM Studio on your local PC, including censored and uncensored options such as Llama 3, Mistral, and Phi-3.
4 Censored vs. Uncensored LLMs: Llama3 with Dolphin Finetuning7:54
Open source LLMs can be uncensored and biased; discover dolphin fine tuning on Llama 3 to remove bias, enabling local, private runs.
The Use Cases of classic LLMs like Phi-3 Llama and more5:50
Discover how standard open-source llms run locally to create and edit text, assist with programming, translate, educate, support customers, analyze data, and soon gain multimodal vision features.
Vision (Image Recognition) with Open-Source LLMs: Llama3, Llava & Phi3 Vision8:07
Open-source llms with computer vision enable multimodal on-device models like llama3, pi3, and lava with a vision adapter to analyze and describe images.
Some Examples of Image Recognition (Vision)8:40
Explore open-source vision tools that run locally, describing, interpreting, converting, and extracting images with GPT for vision. See examples of image recognition, evaluation, and privacy-preserving tools on your PC.
More Details on Hardware: GPU Offload, CPU, RAM, and VRAM5:42
Explore GPU offload configurations for large language models in LM Studio, adjusting full, partial, and no GPU offload, and observe effects on CPU, RAM, VRAM, and performance.
Summary of What You Learned & an Outlook to Lokal Servers & Prompt Engineering.5:34
Explore open-source llms that run locally on modest hardware, using LM Studio and dolphin fine tuning to unlock uncensored models and enable prompt engineering for local servers.

New Interface and Models: GPT-oss, Qwen, zAI, Deepseek, Minimax and more1:29
The updated hugging chat interface offers more open source models, including minimax, gpt oss, deep seek, quinn, glm 4.6, and vl models for ocr, with leaderboards to compare options.
HuggingChat: An Interface for Using Open-Source LLMs8:18
Explore Hugging chat as an open-source interface for open-source LLMs, practice prompt engineering, and learn function calling with tools for web search, document parsing, image generation, and more.
System Prompts: An Important Part of Prompt Engineering7:53
Discover how system prompts drive LM performance and form the core of prompt engineering, with presets across hugging chat, LM studio, and customize GPT.
Why is Prompt Engineering Important? [A example]3:14
Explore why prompt engineering is essential for getting the right answers from LLMs, and how word tokens influence reasoning. Apply step-by-step prompts across GPT-3.5, GPT-4, and other LLMs.
Semantic Association: The most Importnant Concept you need to understand4:03
Explore semantic association as the core of prompt engineering, showing how a single word triggers rich context in LLMs like ChatGPT, expanding the search space with related words and examples.
The structured Prompt: Copy my Prompts6:01
Explore the structured prompt framework: modifier, topic, and modifiers, tailoring outputs for audience, SEO keywords, style, and length, with examples like blog posts and Twitter threads.
Instruction Prompting and some Cool Tricks6:34
Learn instruction prompting and three practical tricks to improve outputs, including think step by step and take a deep breath, and how combining prompts yields better responses.
Role Prompting for LLMs6:14
Explore role prompting for LLMs and how assigning a tailored role, such as a professional copywriter, guides semantic associations and context to produce better outputs.
Shot Prompting: Zero-Shot, One-Shot & Few-Shot Prompts7:27
Explore zero-shot, one-shot, and few-shot prompting strategies to tailor open-source models for effective prompt engineering and content generation.
Reverse Prompt Engineering and the "OK" Trick9:43
Master reverse prompt engineering by guiding ChatGPT with a role, telling it to think step by step, and using an 'okay' token-saving reply to craft effective prompts.
Chain of Thought Prompting: Let`s think Step by Step8:14
Learn chain of thought prompting, using either example prompts or letting the model generate its own, and apply the 'let's think step by step' approach to improve reasoning and outputs.
Tree of Thoughts (ToT) Prompting in LLMs10:42
discover how tree of thought prompting builds solutions by generating multiple perspectives, selecting the most logical path, and applying this technique to negotiate salary.
The Combination of Prompting Concepts6:44
Combine prompting concepts to improve open-source LLM outputs by leveraging semantic association, priming, role prompts, structured prompts, and examples for step-by-step quality.
Creating Your Own Assistants in HuggingChat7:51
Create and customize assistants in hugging chat with system prompts and prompt engineering. Explore open-source AI agents that run locally and test with Python code.
Groq: Using Open-Source LLMs with a Fast LPU Chip Instead of a GPU4:11
Grok uses an LPU chip to run open-source LLMs like llama models with far lower latency than GPUs, delivering very fast inference and high tokens per second for real-time apps.
Recap: What You Should Remember2:49
Explore running open-source LLMs in the cloud for free with hugging chat or grok, and master prompts, system prompts, prompt engineering, and semantic association for better results.

What Will Be Covered in This Section?1:44
Explore function calling, vector databases, and embedding models, then install software and set up an LM Studio server to build a private, local rec agent with web search and text-to-speech.
What is Function Calling in LLMs7:04
Explore function calling in LLMs, treating the model as an operating system that delegates tasks to calculators, diffusion models, browsers, and Python tools for local, offline AI.
Vector Databases, Embedding Models & Retrieval-Augmented Generation (RAG)11:58
Explore how embeddings models and vector databases enable retrieval-augmented generation to search uploaded PDFs and documents, even when the context window is limited.
Installing Anything LLM and Setting Up a Local Server for a RAG Pipeline12:06
Install Anything LM and set up a local server to build a private RAG pipeline with open-source models, LM Studio, embeddings, and vector databases.
Local RAG Chatbot with Anything LLM & LM Studio15:28
Build a private, local RAG chatbot with anything LLM and LM Studio, embedding uploaded documents into a local vector database for secure, private AI.
Function Calling with Llama 3 & Anything LLM (Searching the Internet)7:53
Learn to enable function calling in anything LM, integrate web search for a local chatbot, and test internet-driven results using serpent def API and LM studio.
Function Calling, Summarizing Data, Storing & Creating Charts with Python7:58
Explore function calling and summarization with open-source llms, learning to view and summarize documents, store data in long-term memory, and generate charts with a python library.
Other Features of Anything LLM: TTS and External APIs12:10
Explore how Anything LLM enables local AI with text-to-speech and external APIs, configure agent skills and LM preferences, and work with private vector databases like Lance DB and Pinecone.
Downloading Ollama & Llama 3, Creating & Linking a Local Server13:34
download Ollama, install a local llms, start a local server, and connect Llama 3 to LM Studio for local AI workflows with quantized models.
Recap Don't Forget This!3:53
Discover how open-source LLMs function as modular systems, using function calling to connect web search, calculators, embeddings, and vector databases for PDFs and transcripts.

What Will Be Covered in This Section: Better RAG, Data & Chunking1:15
Discover how data quality drives rec app performance, handle messy PDFs and websites, and prepare markdown text with Crawl and Llama Bars, focusing on chunk size and overlap for RAG.
Tips for Better RAG Apps: Firecrawl for Your Data from Websites4:27
Discover how to build better rag apps with fire crawl, converting website content into markdown or JSON for training your retrieval-augmented generation pipeline using LangChain, APIs, and GitHub resources.
More Efficient RAG with LlamaIndex & LlamaParse: Data Preparation for PDFs &more12:32
Explore open-source llama bars and llama index to convert PDFs and other data into markdown for LLMs, enabling efficient data preparation for RAG with Llama parse.
LlamaIndex Update: LlamaParse made easy!2:18
Parse PDFs in the llama cloud with llama parse, outputting markdown, text, or JSON so documents become LLM-ready. This update bypasses llama index for faster, drag-and-drop processing.
Chunk Size and Chunk Overlap for a Better RAG Application8:57
Learn how chunk size and chunk overlap optimize rag applications by enabling precise embeddings in a vector database, with practical rules, defaults, and tuning across text types.
Recap: What You Learned in This Section2:35
Enhance your RAG app with messy data, transform sources to markdown with Fire Crawl, use Llama Index for PDFs and CSVs, and tune chunk size and overlap (1–5%).

What Will Be Covered in This Section on AI Agents1:56
Define what AI agents are and how linking a few llms creates one. Explore frameworks like lang chain with lang graph and lang flow, and run locally with nodejs.
AI Agents: Definition & Available Tools for Creating Opensource AI-Agents14:10
Define ai agents as supervisor llms using sub experts to complete tasks, and explore open-source tools like bot press, LangChain, Lang Flow, and Vector Shift for building autonomously capable chatbots.
We use Langchain with Flowise, Locally with Node.js5:42
Build with Langchain and Flowise while running locally with Node.js. Explore local installation and cloud deployment options, including render and AWS.
Installing Flowise with Node.js (JavaScript Runtime Environment)7:07
Install flow wise locally with Node.js by running npm install -g flow wise. Start the local server with npx flow wise start to access the interface on localhost:3000.
Problems with Flowise installation3:49
Fix flow wise installation by downgrading Node.js to a supported 18–20 version with NVM for Windows, then verify and switch versions as administrator using nvm list and nvm use.
How to Fix Problems on the Installation with Node3:49
Resolve flow wise installation issues by using a compatible node version, downgrading to 20.6.0 with nvm for windows, and switching versions with nvm list and nvm use.
The Flowise Interface for AI-Agents and RAG ChatBots5:25
Explore the Flowise interface for building AI agents and RAG chatbots, learn to use dark mode, navigate flows and marketplaces, and set up local Q&A with templates and documentation.
Local RAG Chatbot with Flowise, LLama3 & Ollama: A Local Langchain App16:39
Build a local, open-source rec bot using Flowise, Ollama Llama3, and LangChain, connecting a local llama server to a conversational retrieval QA chain with an in-memory vector store.
Our First AI Agent: Python Code & Documentation with Superwicer and 2 Worker15:07
Create your first ai agent with a supervisor and two workers—a coding agent and a documentation agent—using flow wise, function calling, and llama models, then generate code and docs.
AI Agents with Function Calling, Internet and Three Experts for Social Media16:33
Build an AI agent system with function calling, a web searcher, a creative writer, and three social media experts to produce a blog post and seven tweets.
Which AI Agent Should You Build & External Hosting with Render12:31
Explore building versatile agents for automation and hosting chatbots for clients via render. Learn when to use local or cloud inference, including open-source tools and OpenAI API.
Chatbot with Open-Source Models from Huggingface & Embeddings in HTML (Mixtral)8:37
Build a chatbot with open-source models from Hugging Face using local inference, embeddings, and prompts, configuring credentials. Embed the chat widget in html or a website and share it publicly.
Insanely fast inference with the Groq API8:27
Boost local ai speed with Grok API for insanely fast inference, reaching around 1000 tokens per second. Explore integrating Grok with flow wise using Llama models and credentials.
How to use DeepSeek R1: Locally, in Browser and the API
Recap What You Should Remember4:29
Discover how linking llms creates ai agents with lang chain and related frameworks, and learn to build, run, and host secure, open-source agents locally for tasks from coding to content creation.

What Is This Section About?2:06
Explore open-source text-to-speech, AI conversations, and fine-tuning workflows in Google Colab, including auto train from Huggingface and Alpaca fine-tuning, plus GPU rental options.
Text-to-Speech (TTS) with Google Colab8:38
Explore open source text-to-speech tools you can run locally or in Google Colab, comparing OpenAI options and using TTS models like rts1 hd and Onnx to generate speech audio.
Moshi Talk to an Open-Source AI4:20
Explore Moshi, a free, open-source AI you can talk to like ChatGPT; join with your email, use it in Google Colab, and generate code with community resources.
Finetuning an Open-Source Model with Huggingface or Google Colab5:59
Fine-tune open-source models, including uncensored ones, with Hugging Face Auto Train or Google Colab notebooks. Manage costs and time by using GPUs and preparing a robust data set.
Finetuning Open-Source LLMs with Google Colab, Alpaca + Llama-3 8b from Unsloth22:51
Show how to fine-tune open-source LLMs with Google Colab, using Alpaca and Llama-3 8b from Unsloth, with a 51,000-row instruction-input-output dataset, and note risks of hallucinations.
What is the Best Open-Source LLM I Should Use?4:03
Evaluate open-source LLMs by comparing leaderboard standings and capabilities, from llama models to gemma and deep decoder, and note grok from xai as a major option.
Llama 3.1 Infos and What Models should you use3:42
Grok from xAI4:09
Grok from xAI presents grok 1.5 with a 128,000 token limit and strong multi-modal vision. Open-source weights enable local runs, though this model requires powerful hardware or an x subscription.
Renting a GPU with Runpod or Massed Compute if Your Local PC Isn't Enough7:28
Rent GPUs with RunPod or Mass Compute when your PC lacks power. Use bloke templates and a one click UI to deploy models from HuggingFace and run on demand GPUs.
Recap: What You Should Remember!6:06
Compare open-source text-to-speech and Colab workflows with OpenAI API outputs for local deployment. Evaluate fine-tuning trade-offs, data quality, and tools like grog and Runbot.

THE LAST SECTION: What is This About?1:50
Explore security and data protection for open and closed source llms, including jailbreaking, prompt injections, data poisoning, locally safe outputs, and whether you can use outputs commercially.
Jailbreaks: Security Risks from Attacks on LLMs with Prompts12:02
Explore how jailbreak prompts threaten llms security, revealing many-shot techniques, base64 encoding tricks, and prompt-based bypasses across open and closed models, with safety implications.
Prompt Injections: Security Problem of LLMs8:31
Explore how prompt injections threaten open-source LLMs and enable harmful prompts to override instructions. Identify tactics like white-text prompts, phishing links, and data exfiltration risks, and learn defensive precautions.
Data Poisoning and Backdoor Attacks3:45
Explore data poisoning and backdoor attacks in open-source LLMs, including fine-tuned models from Hugging Face, and learn the data security and privacy implications.
Data Privacy and Security: Is Your Data at Risk?8:13
Prioritize local operation for privacy when using open source LLMs. LM Studio, llama, grok, hugging chat, and the OpenAI API illustrate offline use, data privacy, and cloud risks.
Commercial Use and Selling of AI-Generated Content8:40
Navigate licensing for open-source LLMs and AI-generated content, including usage rights, attribution, and commercial considerations. Learn how to verify licenses for Llama, LM Studio, Stable Diffusion, and OpenAI's copyright shield.
My Thanks and What's Next?4:41
Explore the benefits and drawbacks of open-source lms and learn to run private, uncensored ai locally using rag, prompts, and agents.
Bonus0:37

Requirements

No prior knowledge is required; everything will be shown step by step.
It is advantageous to have a PC with a good graphics card, 16 GB RAM, and 6 GB VRAM (the Apple M series, Nvidia, and AMD are ideal), but this is not mandatory.

Description

ChatGPT is useful, but have you noticed that there are many censored topics, you are pushed in certain political directions, some harmless questions go unanswered, and our data might not be secure with OpenAI? This is where open-source LLMs like Llama3, Mistral, Grok, Falkon, Phi3, and Command R+ can help!

Are you ready to master the nuances of open-source LLMs and harness their full potential for various applications, from data analysis to creating chatbots and AI agents? Then this course is for you!

Introduction to Open-Source LLMs

This course provides a comprehensive introduction to the world of open-source LLMs. You'll learn about the differences between open-source and closed-source models and discover why open-source LLMs are an attractive alternative. Topics such as ChatGPT, Llama, and Mistral will be covered in detail. Additionally, you’ll learn about the available LLMs and how to choose the best models for your needs. The course places special emphasis on the disadvantages of closed-source LLMs and the pros and cons of open-source LLMs like Llama3 and Mistral.

Practical Application of Open-Source LLMs

The course guides you through the simplest way to run open-source LLMs locally and what you need for this setup. You will learn about the prerequisites, the installation of LM Studio, and alternative methods for operating LLMs. Furthermore, you will learn how to use open-source models in LM Studio, understand the difference between censored and uncensored LLMs, and explore various use cases. The course also covers finetuning an open-source model with Huggingface or Google Colab and using vision models for image recognition.

Prompt Engineering and Cloud Deployment

An important part of the course is prompt engineering for open-source LLMs. You will learn how to use HuggingChat as an interface, utilize system prompts in prompt engineering, and apply both basic and advanced prompt engineering techniques. The course also provides insights into creating your own assistants in HuggingChat and using open-source LLMs with fast LPU chips instead of GPUs.

Function Calling, RAG, and Vector Databases

Learn what function calling is in LLMs and how to implement vector databases, embedding models, and retrieval-augmented generation (RAG). The course shows you how to install Anything LLM, set up a local server, and create a RAG chatbot with Anything LLM and LM Studio. You will also learn to perform function calling with Llama 3 and Anything LLM, summarize data, store it, and visualize it with Python.

Optimization and AI Agents

For optimizing your RAG apps, you will receive tips on data preparation and efficient use of tools like LlamaIndex and LlamaParse. Additionally, you will be introduced to the world of AI agents. You will learn what AI agents are, what tools are available, and how to install and use Flowise locally with Node.js. The course also offers practical insights into creating an AI agent that generates Python code and documentation, as well as using function calling and internet access.

Additional Applications and Tips

Finally, the course introduces text-to-speech (TTS) with Google Colab and finetuning open-source LLMs with Google Colab. You will learn how to rent GPUs from providers like Runpod or Massed Compute if your local PC isn’t sufficient. Additionally, you will explore innovative tools like Microsoft Autogen and CrewAI and how to use LangChain for developing AI agents.

Harness the transformative power of open-source LLM technology to develop innovative solutions and expand your understanding of their diverse applications. Sign up today and start your journey to becoming an expert in the world of large language models!

Who this course is for:

To everyone who wants to learn something new and dive deep into open-source LLMs with RAG, Function Calling and AI-Agents
To entrepreneurs who want to become more efficient and save money
To developers, programmers, and tech enthusiasts
To anyone who doesn't want the restrictions of big tech companies and wants to use uncensored AI

Open-source LLMs: Uncensored & secure AI locally with RAG

What you'll learn

Explore related topics

Course content

Introduction and Overview7 lectures • 16min

Why Open-Source LLMs? Differences, Advantages, and Disadvantages7 lectures • 41min

The Easiest Way to Run Open-Source LLMs Locally & What You Need9 lectures • 1hr 16min

Prompt Engineering for Open-Source LLMs and Their Use in the Cloud16 lectures • 1hr 41min

Function Calling, RAG, and Vector Databases with Open-Source LLMs10 lectures • 1hr 34min

Optimizing RAG Apps: Tips for Data Preparation6 lectures • 32min

Local AI Agents with Open-Source LLMs15 lectures • 2hr 4min

Finetuning, Renting GPUs, Open-Source TTS, Finding the BEST LLM & More Tips10 lectures • 1hr 9min

Data Privacy, Security, and What Comes Next?8 lectures • 48min

Requirements

Description

Who this course is for: