Tag: attention is all you need

  • Going Deeper into AI: The Guts of ChatCPT and Large Language Models

    Going Deeper into AI: The Guts of ChatCPT and Large Language Models

    Nerding Out with That Nerdy Catholic
    Nerding Out with That Nerdy Catholic
    Going Deeper into AI: The Guts of ChatCPT and Large Language Models
    Loading
    /

    In this episode, we take some time to look under the hood of how a Large Language Model (LLM) like ChatCPT works. Building off of the previous episode where we looked at neural networks, the basis for all AI, we explore how these neural networks are used to comb through large quantities of text, find its meaning, and respond with language that is easy to understand.

    We move beyond simple neural networks to discuss the intricacies of Transformer Architecture and Retrieval-Augmented Generation (RAG) and how this approach is central to understanding modern artificial intelligence and machine learning applications.

    We also touch on the benefits of running your own local AI model and agent and how we use Ollama at That Nerdy Catholic to help with some of our tasks. You want to know how LLM actually works? Continue this exploration of AI with us.

    Links from Episode:

    Attention is all you need: https://arxiv.org/pdf/1706.03762

    We want your input for this series, so head over to https://thatnerdycatholic.com/ai to share your thoughts and experiences.

    Get your Nerding Out merch at https://thatnerdycatholic.com/merch
    Support That Nerdy Catholic at https://thatnerdycatholic.com/support
    Facebook: https://www.facebook.com/thatnerdycatholic/
    Instagram: https://www.instagram.com/ThatNerdyCatholic

  • The Technology Behind ChatGPT: Neural Networks and the AI Revolution

    The Technology Behind ChatGPT: Neural Networks and the AI Revolution

    Nerding Out with That Nerdy Catholic
    Nerding Out with That Nerdy Catholic
    The Technology Behind ChatGPT: Neural Networks and the AI Revolution
    Loading
    /

    What is AI, and how does it actually work?

    Artificial Intelligence has exploded in popularity with tools like ChatGPT, but the technology behind it has been developing for decades. In this episode, Seth and Cesar break down Artificial Intelligence for beginners, explaining where AI came from, how it evolved, and why it suddenly became so powerful in recent years.

     

    Starting with the early ideas of AI in the 1950s, they trace the key breakthroughs that led to today’s revolution in machine learning and large language models (LLMs). One turning point was the groundbreaking 2017 research paper “Attention Is All You Need,” developed by Google researchers, which introduced the transformer architecture that powers many modern AI systems.

    In Part 1 of this two-episode series, they focus on the core building block of modern AI: neural networks.

    • What is a neural network?
    • How do artificial neural networks compare to the neurons in the human brain?
    • How do machines actually learn from data?
    • What’s the difference between supervised learning and unsupervised learning?

    If you’ve ever wondered how ChatGPT works, how machines learn patterns, or why AI suddenly seems everywhere, this episode will give you a clear foundation without requiring advanced math.

    For viewers who want to dive deeper into the mathematics behind neural networks, Seth and Cesar also reference the excellent visual explanations from the YouTube series by Grant Sanderson on the channel 3Blue1Brown.

    If you’re curious about Artificial Intelligence, neural networks, machine learning, and how modern AI models are trained, this video is the perfect place to start.