Get the latest AI workflows to boost your productivity and business performance, delivered weekly by expert consultants. Enjoy step-by-step guides, weekly Q&A sessions, and full access to our AI workflow archive.
Summary
In this engaging video, Aaron Jack explores the exciting world of AI agents. He delves into the mechanisms that make AI agents powerful tools for tech professionals. This deep dive extends into how to effectively leverage AI to ensure you remain competitive in the rapidly evolving tech landscape. Aaron highlights the potential of AI agents in transforming workflows and cutting costs while enhancing task performance. The video also provides practical insights into running AI models locally, discusses the growth potential of niche AI businesses, and the importance of continuous learning with AI-focused courses.
Highlights
Aaron switched to Windows for better AI model management, showcasing a radical tech shift. 💻
Simply Learn offers extensive AI and machine learning courses that are highly rated. 🎓
Model size considerations are crucial when running AI locally on limited hardware. 🎛️
Real-world applications of AI in business lead generation are explored. 🌍
Creating a local AI setup involves understanding technical constraints and execution flows. 🛠️
Key Takeaways
AI agents are the future - understanding them is crucial to stay competitive in the tech industry. 🤖
AI agents function as composable building blocks, optimizing workflows and decision-making. 🧩
Running AI models locally can save costs and enhance performance, provided hardware constraints are managed. 💾
AI learning resources, like Simply Learn's courses, play a vital role in career development. 📚
Customization and orchestration of AI workflows can lead to lucrative business opportunities. 💡
Overview
In this captivating tutorial, Aaron Jack, a tech enthusiast and educator, unveils how AI agents are reshaping the tech landscape. With dramatic flair, he underscores the importance of mastering AI to keep pace with technological advancements. AI agents, according to Aaron, act as orchestrators, elevating workflows and making split-second decisions, akin to maestros conducting an orchestra.
Throughout the session, Aaron emphasizes the significance of understanding the architecture of AI, particularly when running these complex models locally. He shares valuable insights into managing model sizes and making the most of limited GPU resources, taking viewers through the step-by-step process of optimizing local AI performance. Such in-depth knowledge is what separates novices from experts, he contends, encouraging continuous learning.
Moreover, Aaron introduces viewers to promising educational resources, notably the Simply Learn AI and Machine Learning courses, which offer hands-on projects and certification, serving as a gateway to mastering AI. His practical guide combines technical know-how with strategic thinking, motivating tech professionals to not just learn about AI but to actively incorporate it within their workflows for business success.
Chapters
00:00 - 01:30: Introduction to AI Agents The chapter 'Introduction to AI Agents' highlights the critical importance of understanding AI technology, especially for those in the tech industry or with an interest in programming. It underscores a bold perspective that failing to grasp the concept of AI agents could leave individuals lagging behind in the future of technology. The discussion points out that AI agents are considered to be a groundbreaking innovation, potentially surpassing the impact of Software as a Service (SAS). This realization has prompted some, including the narrator, to adapt significantly, such as switching to Windows and investing in new hardware. The mention of familiar AI entities like Chatbot GPT and OpenAI is accompanied by an introduction to AI agents, as elaborated in an article by Anthropic.
01:30 - 03:00: Composable Building Blocks Agents can be thought of as composable building blocks, similar to programming patterns. These can be used to build workflows that either augment or replace functions by executing actions. At a higher level, agents orchestrate workflows and functions by deciding what actions to take and determining subsequent actions based on results. These concepts are akin to known programming constructs.
03:00 - 04:30: Trade-offs in Agent Systems The chapter titled 'Trade-offs in Agent Systems' covers concepts related to programming, especially in the context of agent systems. It discusses prompt chaining, which is likened to making multiple function calls with optional error handling, and routing, which is compared to parallel concurrent async programming. These techniques are used to improve performance by enabling multiple tasks to run simultaneously. Furthermore, the terms orchestration and synthesis are explained as processes similar to those carried out by data engineers, who transform large datasets into more structured formats. The chapter suggests that while these workflows and systems are beneficial, they come with trade-offs, although the specific trade-offs are not detailed in the transcript.
04:30 - 06:00: Learning AI: Importance and Resources This chapter discusses the importance of understanding the latency and cost involved in running AI agents. It highlights how AI agents are both time-consuming and expensive due to the various back-tasks, multiple recursive LLM (Language Model) calls, and the necessity of large context feeds to understand past actions. However, an intriguing solution to these issues is presented: the option to run large language models (LLMs) directly on a personal computer. This approach is both cost-effective and faster, providing an exciting alternative to the traditional methods. The chapter emphasizes the significance of considering these factors in AI learning and offers resources to overcome the challenges.
06:00 - 09:00: Running LLMs Locally The chapter emphasizes the importance of learning AI to secure one's future career, suggesting that learning AI is crucial to thriving in the job market. It references Simply Learn's AI and machine learning courses as serious resources for learning. It also mentions that the video is sponsored by Simply Learn, and highlights Microsoft-backed AI courses as particularly noteworthy.
09:00 - 10:30: Origami Agents and Business Applications This chapter discusses an engineering course that spans various topics, such as generative AI, deep learning, and prompt engineering. It offers over 25 projects and a Capstone to provide hands-on experience. The program includes electives for majoring in advanced generative AI and NLP, and it prepares participants for the Microsoft Azure certification exam. Upon completion, participants receive a certificate from Microsoft. The course has positive reviews, with ratings of 4.5 on Switch Up and 4.4 on Career Karma.
10:30 - 14:30: Building a Simple AI Agent Application The chapter discusses options for financing AI education and encourages checking out Simply Learn's website for structured learning courses. The chapter is sponsored by Simply Learn. It provides instructions on visiting 'ama.com' to download AI models for free, explaining the steps to navigate to the models tab and view the available list.
Learn AI Agents - How they Work & Build Your Own Transcription
00:00 - 00:30 this is going to sound dramatic but if you're in Tech if you're a programmer building apps or just interested I really think if you don't understand this then you're going to get completely left behind in the coming years why combinator is saying this is going to be 10 times bigger than SAS it actually led to me switching to Windows for the first time ever and buying a new laptop when you think of AI apps you probably think of okay chbt open AI but once or twice you might have heard of AI agents and this article by anthropic lays it out in
00:30 - 01:00 a really solid way you can think of Agents as composable building blocks similar to patterns in program and with these building blocks you can either build workflows which augment your code and replace functions and can do a set of actions or you have agents which are a level higher they orchestrate your workflows your functions so basically they choose what actions to take and then based on the result of that action they choose what to do next now these building blocks I spoke about they're very similar IL to Concepts you already
01:00 - 01:30 know if you know anything about programming you have prompt chaining which is the same as doing multiple function calls with optional error handling evaluator Optimizer it's just a loop routing is like parallel concurrent async programming which helps improve your performance by running multiple things at once and orchestration and synthesis basically what data Engineers do you take large data sets and you transform it into a more useful structured format as we can see there's a big catch with these types of workfl flows and systems they often trade
01:30 - 02:00 latency and cost for better task performance so agents are expensive in time and money they take a while to run because you're doing all these backtack tasks and you're doing tons of llm calls maybe recursively in a loop and feeding in large context because your agent needs to understand the past actions that it's already taken but and this is super interesting is there is a way completely around this and it's why I bought the new laptop it allows you to run llms on your computer for free and a lot faster everyone is saying it it's
02:00 - 02:30 not AI that's going to take your job it's someone that knows AI better than you do if that's true at all what we're doing here learning is absolutely key for securing your future career in other words you want to thrive you got to learn this stuff as much as possible now the most serious courses that I've come across when I was searching around trying to learn are simply learns Ai and machine learning courses give me just a minute CU if you want to go deep I think they're worth checking out and just a heads up this video is sponsored by simply learn one of the best ones I came across was the micros soft bagged AI
02:30 - 03:00 engineer course because it covers everything from generative AI to deep learning prompt engineering and more there's over 25 projects and a Capstone so you'll walk away with a lot of hands-on experience then there's electives which really let you specialize advanc generative AI NLP and even preparing for the Microsoft azer certification exam and in the end you even get a certificate from Microsoft and if you're curious about reviews 4.5 on switch up 4.4 on Career Karma you can check those out they've also got
03:00 - 03:30 financing options so I would encourage you if this sounds interesting at all at least check out Simply learns website evaluate some of the different courses and this is a really structured way just to get fully immersed in AI so if you're interested check out the pin comment or Link in description thanks again to Simply learn for the sponsor back to the video all you need to do is go to ama.com and you can download a bunch of different ones for free so running through this really quick you just go to the models Tab and you can see a full list here it goes goes on and on now
03:30 - 04:00 most important part when you're running it locally your model size has to be less than your GPU vram so this particular card RTX 4070 it has 8 GB so I have to check how big is the model not in terms of uh parameters like this one has 70 billion but in terms of the actual let's say file or uh trained model size so I can go into for example llama 3.3 it's going to be too big I
04:00 - 04:30 already know with the 70 and the 405 um billion parameters so what I do is I just go into tags and I can see like yeah they're 40 49 53 and so on I already have a few installed and again you can install a new one just by running this command and it will immediately start running it but I can uh show you which ones I have installed so o llama LS and you'll see that I have quen I have llama 3
04:30 - 05:00 actually have a compressed version right now and uh just cuz I was testing it and I have L lava which is image analysis uh so these quen models you'll notice they're actually even the compressed ones are under eight but actually when I run it it is not fully using my GPU so if I do if I run quen instruct Q2 let me just show you what I mean so once you run that you get a command prompt and you can kind of test the speed by typing in a command hello
05:00 - 05:30 and even for that really simple one you'll see there was a bit of a delay now if I go to a new window and I do AMA PS I can see the reason is because actually when this model is running the size expands to 7.5 GB because it needs a little bit of extra space and then on top of that my GPU needs some extra space to run normal processes so while it's able to fit 91% into the GPU that's still going to be a pretty big performance hit because it has to offload things to the CPO which is just
05:30 - 06:00 like exponentially slower and it's what you'd have to do if you don't have a GPU locally so let's just kill this one and we'll instead run the Llama 3.2 so I just copy that model name just running it again because it didn't fully stop the previous one and now when we type hello boom instant response that's what we want now if we check the you uh utilization we can see it's 100% GPU it expanded to 5.4 but that's okay as long as you you have this
06:00 - 06:30 when you run a llama piece companies like this origami agents it's in y combinator in the first month I think they're doing 100K recurring Revenue already and let's just take a look at this company and try to build a simple version so they do business lead generation with custom prompting and their whole pitch is something I've described in the past few videos you have virtually unlimited unstructured data on the web that maybe doesn't exist in a database you have access to and if you can extract and structure this in a useful way this is a huge opportunity
06:30 - 07:00 even if you're building a very Niche agent for a very specific industry so if we scroll down we can see some of the queries people are able to run find woocommerce store owners who sell products covered in uh By Us health insurance here's another one you can visit the site if you want to see them all but let's take a look at this one specifically so if we think in terms of agentic patterns so for this one if we think in terms of agentic patterns how would we actually achieve this with various workflows to make this a bit more concrete we have the orchestrator at the top which first chooses what
07:00 - 07:30 steps we want to run so first find products second find stores and then third see if they're woocommerce or not and within each step we'd have sub workflows so we might have just have a search Google workflow a scrape article site to extract the information we need so I've actually coded a simple version of an app that's very similar to what a B2B agent like origami would do but I've given you the choice here between running it locally for free or using the open AI API but again I'll emphasize
07:30 - 08:00 that agents specifically they can do a lot of llm calls consume a lot of tokens so you can see my component files map to the architecture that we talked about first I have the orchestrator which has a lot of different methods and again if you really want to get into it just check out the code but at the highest level first we're generating the tasks upfront first do this then do this then do this and use subw workflows to accomplish them then another important method we have a prompt to select the next workflow from the workflow definitions in these folders which are basically the same as just programming
08:00 - 08:30 functions for our workflows I could have gone super generic and just written a crawler with a custom prompt but I wanted to make it a little bit more specific where I can where I have one just for search Google where I can ensure that the search results are getting crawled and extracted correctly so I've also added a little bit of custom scraping code here if we're doing B2B having something that finds LinkedIn profiles is very useful and I've specified here that we want to just pull the serps or Google search results for let's say name company name let's say we
08:30 - 09:00 do a prompt like find me all the coding boot camp owners in the USA and send me their name and their LinkedIn profile so I can message them that would be an example use case there and then of course we have our generic crawl site so we have these search results for Google then our orchestrator will decide which ones are worth visiting that might have the information we want so I've run this a few times already and I'll show you the output I put in find 10 Facebook software engineer names and their LinkedIn profiles and we can see just from this prompt the model completely
09:00 - 09:30 ran and extracted exactly what we need we have name profile URL and position so software engineer software engineer yeah pretty much all software Engineers with direct links let me just show you another example of this let's say you want to find Shopify apps to Market a specific product to them all I have to do is say find me 10 Shopify apps then find their Founders SL owners and their
09:30 - 10:00 their names and Linkedin URLs now I don't want this video to get too long but let's just see what it does first so so here our orchestrator has broken it up into two key tasks first find the Shopify apps and names from a Google search then get founder names and Linkedin URLs so it's going to complete the first one before the second one then it selected the search Google workflow and we can see the search query here that'll run for a little bit and then we'll see that our step was complete with a summary so I found the first Shopify app which is named clavio and it found and it's doing another search
10:00 - 10:30 query for clavio Founders LinkedIn and so far it found two results now it's doing the same for oero privy and it's just going down through the list of URLs that we found so this is going to continue to run let me just show you the end output with that very simple prompt that we put in in the beginning so here we can see a result summary and it also saved us a Json file let's open that file and first we can see it got all the app names and URLs but and we can also look at the summary down here unfortunately it didn't 100% understand
10:30 - 11:00 this because for certain apps it found more than one person so of course you could further refine The Prompt that you're inputting to get a different result but let's just take a look at one of these URLs to see if it's correct and yeah for this one at least we got the co-founder at sprocket let's just check another one toer tagrin for yo and yeah so we got the CEO there so so of course it's still not perfect I coded this agent in about 1 day but you can probably start to see the power of composing things together doing custom workflows and having a really solid
11:00 - 11:30 orchestrator that being said if you like this video please leave a comment so I can make more agent videos or AI videos and with that being said I'll see you guys in the next one