AI Archives - Lon Seidman & Lon.TV Blog

This Retail GPU Works Great for Local AI !

I recently began exploring the practical applications of local AI by integrating a new GPU into my hardware configuration. I purchased an Asus card equipped with 16 gigabytes of video memory (compensated affiliate link) that by today’s standards is at a fairly “budget” price.

Check out what it can do in my latest video!

16 gigabytes is enough RAM to house the Gemma 4 26-billion parameter mixture of experts model. This specific model provides a balance of performance and efficiency that mirrors what many users expect from cloud-based subscriptions, but without the need for external data processing and costs.

My current setup involves an external GPU configuration using an Oculink Thunderbolt dock connected to a GMKTec mini PC. While the computer itself contains 64 gigabytes of system RAM, running these models exclusively on a GPU is necessary for maintaining acceptable speeds. For the software interface, I am using LM Studio, a free cross-platform application that allows for fine-tuning performance settings. One useful feature of this software is the ability to offload portions of a model to system memory if the GPU’s capacity is exceeded, though this results in a significant reduction in processing speed.

During initial testing, the Gemma model generated text at a rate of approximately 45 tokens per second generating a short fiction story. While the system consumes about 200 watts of power under a full load, it idles at 36 watts. Beyond simple text generation, the setup is capable of visual analysis. In one test, I provided the model with a photograph of some friends and I in front of Space Shuttle Atlantis. It accurately identified the shuttle and the attributes of the individuals in the frame, correctly processing the visual data without external assistance.

I also tested the model’s ability to handle complex document analysis. I combined a 24-page FCC proposal regarding prepaid smartphones with a transcript of a video I had previously recorded on the topic. Because LM Studio does not currently support PDF files, I converted the information into a single text document. After increasing the context length to its maximum setting to ensure the AI could “see” the entire file, I asked it to draft testimony for the FCC based on my specific concerns raised in the video and attributing those concerns to a specific portion of the FCC’s draft proposal. The model successfully identified my points regarding the privacy of whistleblowers and reporters and was mostly correct in its attributions.

The model demonstrated further utility in coding tasks. It managed to write a functional browser-based Space Invaders clone in a single attempt, including basic game logic and sound effects. Later, I used it to generate a Python script designed to scrape education statutes from a state website. When the first version of the script returned an error, the AI analyzed the problem and provided a corrected version that successfully consolidated numerous legal chapters into one searchable document.

For datasets too large for a standard chat interface, I utilized an application called Anything LLM to perform retrieval augmented generation, or RAG. This process involves indexing and embedding documents locally so the AI can query them efficiently. I uploaded five megabytes of state statutes and asked the model to calculate a specific school grant amount based on a complex formula. The model performed the necessary math and returned the correct figure of $2.2 million. I found that while the local model sometimes requires more specific prompting than cloud-based alternatives like Google’s Notebook LM, it is capable of providing high-fidelity results.

Running larger models, such as those with 31 billion parameters, reduces performance to about five tokens per second when the GPU memory is exceeded. However, the flexibility to swap between various models from Google, Qwen, or other developers allows for a customized approach to different tasks. These tools have reached a point where they are functional for data analysis and automated workflows while keeping all information on a local server. For anyone with a modern video card and a sufficient amount of VRAM, these local models offer a viable way to experiment with AI without relying on the cloud.

Local AI is Finally Usable! Real-World Workflows on a 5-Year-Old Mac with Google Gemma

For the past few years, I have been downloading local AI models to determine if they could handle practical automation tasks or summarize long-form content. Historically, these experiments have been unsuccessful, with the models typically failing to provide comprehensive results or losing track of the original context. However, recent developments in model optimization, specifically on Apple hardware, have changed the baseline for what is possible on a personal computer.

In my latest video, I demo running Google’s latest Gemma 4 model, a 26-billion parameter “mixture of experts” model optimized for the Mac using the MLX framework on my 2021 Macbook Pro with an M1 Max and 32GB of RAM.

Check it out here!

I observed the model generating approximately 50 tokens per second. While this is slower than a high-end cloud-based system, it represents a very usuable generation speed for a local setup. The unified memory architecture of the Mac allows the GPU to access data efficiently, which is why these older machines remain relevant for AI tasks that would otherwise require significant cloud computing resources.

During my testing, I provided the model with a transcript from a recent video to see if it could produce a coherent summary. Unlike previous local models that often provided incomplete or erratic responses, this model maintained a consistent narrative and adhered strictly to the provided text. I also tested it with a dense legal document from an FCC docket. After processing a large amount of extracted text, the model was able to delineate the key arguments of the filing and, upon further prompting, condensed the information into a concise executive summary.

I also examined the model’s vision capabilities using a tool called MLX studio, which supports image analysis. I uploaded a photograph with some friends and I in front of a space shuttle and asked the model to describe the scene. While it misidentified the vehicle as a Dreamchaser—a different type of spacecraft—the level of detail was a step forward from earlier local models that often provided much less accurate descriptions. This functionality is particularly useful for my ongoing project to index a large archive of digital photos dating back to 1997. Using a local model for this type of organization could potentially eliminate the costs and privacy issues associated with the thousands of API calls required for cloud-based indexing.

To test the model’s utility in a production environment, I integrated it into my N8N automation server. I currently use a cloud-based AI with my N8N server to scan news feeds and identify relevant stories for my daily work. I ran a portion of this same workflow using the local Gemma model to see if it could replicate the results. It took approximately three minutes to process the news briefing. Although the results were not quite as polished as those from the cloud, the model successfully identified unique stories and avoided duplicated stories about Apple’s WWDC event that were being published at the time.

Google appears to be prioritizing the development of effective local models more than some of its competitors, providing a way for users to utilize AI without incurring expenses beyond the electricity required to run their own hardware. Seeing a 26-billion parameter model function with this level of stability on a five-year-old laptop has caused me to rethink my existing workflows. I am now looking at which of my daily tasks can be moved away from the cloud and managed entirely on my own hardware.

See more videos like this here!

Using AI to Organize and Curate My Plex Library ! (sponsored post)

Many media server enthusiasts like me spend more time managing our libraries than actually consuming the content within them. I recently looked into whether artificial intelligence could streamline this process, specifically within the Plex ecosystem.

In my latest monthly sponsored Plex video, I install a Plex MCP and connect Open AI’s Codex to see what we can do.

My goal was to see if AI could handle the organizational tasks that usually require manual input, such as curating specific story arcs or generating recommendations based on viewing history.

The bridge between the AI and the media server is a Model Context Protocol (MCP) server. MCPs in this context connect an AI desktop client—such as OpenAI’s Codex, Claude Desktop, or Gemini—to a local Plex instance. Setting this up requires a Plex token, which is accessible through the XML info of any media item in the library. Once the token is dropped into the configuration screen, the AI can interact with the server’s data through plain English commands. I demo this in the video.

One practical test involved creating a playlist for a specific narrative arc within Star Trek: Deep Space Nine. Rather than manually selecting episodes that contribute to the “Dominion War” storyline, I provided the AI with a text file containing an episode list sourced from a Reddit discussion. The system generated a sequential playlist in approximately ninety seconds. Beyond simple lists, the AI can also aggregate data from the web. I asked it to research the highest-rated episodes of Star Trek: The Next Generation across multiple sources and build a playlist based on those aggregate ratings. While this research-heavy task took several minutes, it resulted in a curated selection based on external critical consensus without any manual sorting or list creation on my part.

The utility extends to music as well. I requested a smart playlist for 1980s pop music. By defining the decade and genre in a simple prompt, the AI configured the metadata filters within Plex to include tracks from 1980 to 1989 categorized as pop or rock. Because it was set up as a smart playlist, it automatically updates whenever new tracks fitting those criteria are added to the library in the future. This replaces the need to navigate through various menu layers to set up complex filtering rules.

Analysis of viewing habits is another area where the AI provides utility. By accessing watch history, the system can identify content that has been overlooked. For instance, I asked it to find episodes of a specific children’s show, Bluey, that had the lowest view counts to find episodes my children might have missed. It can also cross-reference a watch list with a viewing history to suggest new series. When I requested science fiction recommendations, the AI suggested several shows I had not yet watched, such as Farscape and Babylon 5. It then added these items to my Plex watch list upon request and identified which streaming services currently host them.

For those interested in more advanced automation, the AI can be used to monitor upcoming releases of TV shows and movies. I currently use a routine that scans my watch list every morning and sends an email notification when new episodes of my tracked shows are scheduled to return to streaming services or broadcast.

By using an AI as an intermediary, the technical burden of database management is replaced by a conversational interface. It’s a definite time saver and fun to play with!

See more of my Plex content here!

Vibe Coding New Plex Apps ! (sponsored post)

For this month’s sponsored Plex video, I examined the process of integrating the Plex API with AI coding assistants like Claude and Google Gemini. The primary objective was to determine whether natural language prompts could generate functional applications to control and analyze data on my local Plex media server.

See it in action in my video!

The development setup was relatively straightforward. I accessed the Plex Media Server API documentation and downloaded the OpenAPI specification, which resulted in a single JSON file. After placing this file in a dedicated local directory, I instructed Claude’s coding application to reference it for API structure.

I tested this approach with Claude Code, ChatGPT’s Codex, and Gemini’s command-line interface on a Mac. All three tools successfully read the JSON file, interpreted the API requirements, and edited the application files directly on my local machine. Since these applications were designed to run locally, standard authentication was bypassed in favor of a Plex token. This token can be retrieved by viewing the XML data of any media item within the Plex web interface and extracting the character string from the resulting URL. You can see how to do that in the video.

The initial test was a swipe-based media selection tool. I requested an interface that presented random movie recommendations, where swiping right would immediately trigger playback on an Android TV client. Claude generated the core functionality on the first attempt, requiring only minor debugging to ensure the player execution command operated correctly. By default, the coding tools tended to write the web applications in NodeJS. However, to utilize an existing web server on a Synology NAS, I instructed the AI to rewrite a subsequent project in PHP.

This PHP project resulted in a jukebox-style application designed for multiple users on a local network to add songs to a Plexamp que. By scanning a QR code, users access a client screen on their mobile devices where they can search my server’s music library and submit song requests. As the administrator, I monitor the queue from an admin interface and have the ability to reorder the requested tracks, shifting specific songs up or down the playback list before they route through Plexamp.

Subsequent experiments focused on data retrieval and display. I directed the AI to build a statistics dashboard that analyzed my viewing habits over the past year. By programming the app to filter out content consumed by my children, it generated a localized report on my specific media consumption patterns and active viewing days.

A final application served as a digital “Now Playing” marquee. It queries the server to display the current media’s thumbnail and a progress bar, while simultaneously pulling a list of similar titles from the library. Clicking any of the recommended titles halts the current video and initiates playback of the new selection.

My initial experiments suggest the barrier to entry for developing customized Plex experiences has lowered significantly. Where interacting with a platform’s API once demanded fluency in specific programming languages, I found that natural language processing models now act as a functional bridge between raw documentation and executable code.

Moving forward, the integration of Model Context Protocol (MCP) to instruct the AI on Plex’s API instructions will likely make things more efficient especially for those on constrained token limits with their AI provider. I’ve found Gemini Pro’s command line interface to be pretty generous in its token allocations.

See more of my Plex content here!

Using Gemini AI’s “Nano Banana Pro” To Enhance Old Digital Photos

On my Gadget Picks channel, I reviewed the Kodak Charmera, a cheap keychain sized, 1.6-megapixel camera whose main appeal seems to be less about image quality and more about novelty. The camera is sold as a Labubu-style blind-box product, with different designs packaged randomly, and that scarcity has led some scalpers to charge far more than its original price. Amazon does have them in stock at the time of this writing (compensated affiliate link).

The image quality straight out of the camera is pretty bad—similar to what one might experience from an early consumer digital camera. But could Google’s new Nano Banana Pro AI model fix these images up and make them look modern? That’s what I explore in my latest video.

The Charmera produces images that are noisy, soft, and lacking in detail. On their own, they are barely usable. Using a prompt that Gemini itself helped generate, I fed in a selfie taken at my desk. The original file was a blur of digital noise, but the output that came back was far more detailed, with accurate colors and recognizable objects in the background. While there was some smoothing that made the image look slightly retouched, it largely preserved what was actually there.

That initial result led me to try a variety of other images. I photographed a small holiday decoration, a candle, my dog, and an outdoor scene, all using the Charmera. In each case, Gemini produced images that looked closer to what I might expect from a modern smartphone. Details that simply were not visible in the original files appeared in the processed versions, from textures on a figurine to fur and reflections. The framing and perspective stayed consistent, even when depth-of-field effects were introduced.

The experiment didn’t stop with new photos. I also revisited digital images from the late 1990s, taken with a Kodak DC120 camera. Many of those files I saved at very low resolutions, such as 320×240, which were the sharpest looking on my 1024×768 display at the time but look especially rough on today’s high-resolution displays. Running those decades-old images through Gemini produced mixed but often striking results. In some cases, textures and facial details appeared that made the photos feel contemporary, even though the originals had almost no usable information at the pixel level.

I also found Nano Banana to be a great compliment to another Kodak-licensed product, the Slide N Scan photo negative scanner. The scanner is inexpensive (comparatively) and can rapidly scan photo negatives and slides. But the output quality is nowhere near where it needs to be for professional use. But Gemini was able to dramatically transform a few of the images I fed through it from that scanner.

Not every result was faithful to the original. In some images, Gemini appeared to invent details when there wasn’t enough data to work with. A dog’s fur texture changed noticeably, and in one image of me running with my dog, my face was clearly not my own.

Scanned photos from books and yearbooks were generally handled well, including colorization, but there were occasional distortions in faces or text. Logos and lettering were sometimes incorrect or duplicated, especially when the source material was ambiguous or mirrored.

I also found that context mattered. When I scanned a 1994-era Polaroid of my Powerbook 180c and a Newton I had to give Gemini more specific hints about what was in the image. Gemini convincingly recreated the devices and dropped them in place. At first glance it looked amazing. But some elements—particularly text—were reconstructed inaccurately. In the below example you’ll see that Gemini replaced the “Macintosh” text on the computer with “Powerbook.”

Working through these examples made it clear that tools like Gemini are doing something very close to what modern smartphone cameras already do. Computational photography has shifted the process away from simply capturing light and toward interpreting data. In that sense, using Gemini on an extremely poor image from a toy camera is not all that different vs. what happens inside many smartphones today.

Used carefully, it can make old or low-quality images usable again. But it can very quickly cross the line from enhancement into fabrication. That balance is something worth keeping in mind as these tools become more accessible and more powerful.

Apple Already Told Us Their AI Plan in 1987 with the Knowledge Navigator concept?

I just finished watching Apple’s keynote, and like most years, it was a predictable lineup of iPhones, AirPods, and Apple Watches. The hardware got its annual refresh, but there wasn’t anything that felt new or unexpected. The biggest topic of conversation was what Apple didn’t show: updates on its lagging AI strategy.

The “Apple Intelligence” feature set still feels underwhelming, and it made me think back to the Knowledge Navigator AI agent concept video Apple made in 1987 that might give us a clue about what they might be working on today.

I explore that and show you my own AI agent workflows in my latest video.

I first saw the Knowledge Navigator video as a kid in the early ’90s, when some friends and I formed an Apple user group that received promotional videos like this from Apple.

At the time, the Knowledge Navigator seemed like science fiction, but watching it now, it feels like a plausible direction for Apple’s AI ambitions. The video depicts a professor interacting with a digital assistant that not only responds to commands but anticipates needs—pulling up articles, reminding him of events, leaving messages, and even coordinating schedules with presumably other people’s agents.

What struck me most was how the agent handled tasks on the professor’s behalf, like trying to reach someone by phone, leaving a message, and then being ready to relay instructions when she called back. It even set up meetings.

If both parties had agents, they could negotiate directly without human back-and-forth. That kind of invisible efficiency is something I’d welcome—scheduling meetings is one of the biggest time sinks I deal with. With language models as capable as they are now, this no longer feels like far-off science fiction.

I suspect Apple is quietly working on this agent model. Their recently released Apple Invites app caught my attention because it seemed like such an odd standalone product, but it would make sense as a building block in a future where AI agents manage more of our day-to-day logistics.

When Apple is finally ready to make their big AI push, I think it will be around agents. “I’ll have my Siri call your Siri and we’ll do lunch” might be in our near future.

I’ve been experimenting with this idea myself. Using an open-source tool called N8N, I’ve built a few agents that automate parts of my routine. One sends me a daily morning email with my calendar and curated stories from the gadget and cord-cutting sites I follow. It uses Google’s Gemini API model to filter through RSS feeds and highlight what I might want to cover on my channel. The setup works well enough that it reminds me of the professor’s morning briefing in that Apple demo.

Scheduling is trickier. I’ve tried building an agent that can handle booking meetings based on my availability, and while it sometimes works, it’s far from reliable. Getting the models to properly parse my calendar was a challenge until GPT-5 came along, but even then, the success rate isn’t high enough to trust it with real interactions. Still, the framework is there, and it feels like a glimpse of what’s possible once the technology matures.

Right now, most consumers are engaging with AI through search-like interactions, asking questions and getting quick answers more efficiently than searching on their own. But the real leap will come when agents can act on our behalf, working with other agents to complete tasks without constant human oversight. That’s the vision Apple hinted at nearly 40 years ago, and it may be the key to making their AI efforts feel truly impactful when they finally step into this space.

DIY AI: Running Models on a Gaming Laptop for Beginners!

When DeepSeek AI burst onto the scene a week or two ago, it shook up the industry by proving that large language models can be made more efficient – in fact it’s possible to get the full DeepSeek model running on hardware that a mere mortal could acquire with a few thousand bucks. This shift raises an interesting question—can useful AI models run locally on consumer-grade computers now without relying on cloud-based data centers?

In my latest video, we take a look at running some “distilled” open source versions of DeepSeek and Meta’s Llama large language models. I’m surprised how far the quality of locally has come in such a short period of time.

To find out, I tested a distilled version of the DeepSeek model on a Lenovo Legion 5 laptop, which is equipped with an Nvidia 3070 GPU and 8GB of VRAM. The goal was to see if local AI could generate useful results at a reasonable speed.

The setup process was straightforward. After downloading and installing Nvidia’s CUDA toolkit to enable GPU acceleration. I then installed Ollama which is a command line interface for many of the available models. From there, it was just a matter of selecting and downloading an appropriate AI model. Since the full DeepSeek model requires an impractical 404GB of memory, I opted for the distilled 8B version, which uses 4.9GB of video memory.

With everything in place, I launched the model and checked that it was using the GPU correctly. The first test was a basic interaction in the command line. The DeepSeek model responded quickly and even displayed its thought process before generating a reply, which is a unique feature compared to traditional locally hosted chatbots. Performance-wise, it was surprisingly snappy for a locally run AI.

To gauge the model’s practical utility, I compared it to Meta’s open-source Llama model, selecting a similarly sized 8B variant. Performance between the two was comparable in terms of speed, but the responses varied. While DeepSeek’s output was structured and fairly coherent, Llama’s responses felt more refined in certain cases.

To take things further, I integrated Open WebUI, which provides a ChatGPT-style interface for easier interaction. This required installing Docker, but once set up, it significantly improved usability.

Next, I tested both models with a programming task—creating a simple Space Invaders game in a single HTML file. DeepSeek struggled, generating a mix of JavaScript and Python code that didn’t function correctly. Even when prompted differently, the results were inconsistent. The larger 14B version of DeepSeek running on my more powerful gaming PC did slightly better but still failed to produce a playable game. The Llama model performed marginally better, generating a somewhat functional version, but it was still far from the quality produced by cloud-based AI models like ChatGPT, which created a polished and working game on the first attempt.

For a different type of challenge, I had the models generate a blog post based on a video transcript. Initially, DeepSeek only provided an outline instead of a full narrative. After refining the prompt, it did produce something usable, though still less polished than ChatGPT’s output. Llama performed slightly better in this task, generating a clearer and more structured narrative after a nudge to get it out of its outlining mindset.

While local AI models aren’t yet on par with their cloud-based counterparts, the rapid improvements in efficiency suggest that practical, high-quality AI could soon run on everyday devices. Now that DeepSeek is pushing the industry to focus on optimization, it’s likely that smaller, more specialized models will become increasingly viable for local use.

For now, running AI on consumer hardware remains a work in progress. It’s come considerably far from where it was just a year ago, so it’ll be exciting to see what happens next.

Plaud AI NotePin Review

I recently got my hands on the NotePin by Plaud AI, a compact and wearable voice recorder with a robust set of AI tools attached through its accompanying mobile app. Plaud’s value-add is that they’ve simplified the process of generating transcriptions (complete with speaker detection) along with AI generated summaries.

You can see it in action in my latest review.

The NotePin is priced at $169 (compensated affiliate link) with an additional $79 per year subscription for the “Pro Plan” that includes additional monthly transcription minutes and additional summarization templates. You can also find the NotePin at Amazon (compensated affiliate link).

The free plan, however, is still quite functional, offering 300 minutes (5 hours) of transcription per month along with the summaries of those transcriptions. The Pro Plan comes with 1,200 monthly minutes (or 20 hours) of transcription time.

All of its AI magic happens in the cloud. The NotePin itself is just an audio recorder with 64 GB of storage and enough battery life to run for well over 10 hours between charges. It’s small, lightweight, and comes with accessories to wear it on your wrist, neck, or clipped to your clothes.

One of the things I would have liked to see is a clearer indication that the device is recording. The small red light that turns on is easy to miss, especially when it’s placed on a desk. That said, the recording process is straightforward: press down on the center of the NotePine to start and stop recording, with some haptic feedback to confirm the action.

The Plaud App handles all of the file management and transcription. The device connects via Bluetooth, and while that’s functional, transferring files takes time—an hour-long recording might take five to ten minutes to fully transfer. There’s an option to switch to Wi-Fi mode to speed this up, but it’s not on by default. Once a recording is transferred, you can either keep it as an audio file or send it to the cloud for transcription and summarization.

I tested this at a recent school board meeting, where I was surprised at how well it picked up voices across a large room. After uploading the audio to the app, the transcription process was smooth.

It labels the speakers, but you need to manually assign names to the voices in each session. It unfortunately doesn’t retain the voice prints of speakers that have been identified in prior sessions, so speakers need to be labeled every time. The app doesn’t always differentiate between speakers accurately, especially when they’re far from the microphone, but overall, the transcription quality was impressive.

What I found most interesting was the summary feature. The app generates a concise breakdown of the meeting, highlighting key points and action items. You can also adjust the summary format based on the type of meeting. The summary was mostly accurate, though there were a few minor mistakes. But for anyone looking to quickly capture the essence of a discussion without diving deep into the details, I found it to be quite effective. The minutes can be exported into a number of popular formats like Word, PDF and Markdown.

Another useful feature is that you can upload audio from other sources into the app for transcription, meaning you’re not limited to recordings made on the Note Pin itself.

If you don’t exceed the free five hours of transcription per month, I found you won’t need to pay anything extra, though that could change in the future. Many companies I’ve covered in the past discover that a robust set of free server-side features are often hard to sustain over the long term.

If you’re in need of a quick, easy, and compact tool for turning meeting recordings into transcripts and summaries without much hassle, this could be a good fit. It’s not doing anything that you couldn’t do yourself with free transcription tools and services like ChatGPT, but I like the turnkey simplicity that Plaud has put together along with an elegant and simple piece of hardware.

Disclosure: Plaud.AI provided the NotePin to the channel free of charge. They did not review or approve this review before it was posted and all of the opinions express are my own.

Run Your Own ChatGPT Alternatives with Chat with RTX and GPT4All

My latest video looks at ChatGPT alternatives that can be operated on personal computers, including PCs and Macs.

I first look at Nvidia’s Chat with RTX, a tool enabling users to run a ChatGPT-like chatbot locally. Chat with RTX only works with Nvidia’s newer 30 or 40 series GPUs, which could be a limitation for some users. I tested it on a Lenovo Legion 5 Pro (affiliate link) that had an RTX 4060 GPU on board. Disclosure: the laptop is on loan from Lenovo.

I then tried GPT4All, an alternative open-source large language model client that offers similar functionality to Chat with RTX but without the need for high-end GPU hardware. Like Chat with RTX, GPT4all is user-friendly, requiring minimal setup and no advanced developer tools. GPT4All is compatible with various operating systems, including Macs, Linux, and Windows, broadening its accessibility. However, for optimal performance, 16 GB of system RAM is recommended especially on Windows.

In testing these platforms, I observed that while these AI models are capable, they are not nearly as good as ChatGPT. My test involved having the AI’s summarize one of my prior video transcripts for a blog post. I found that they more often than not got the context of the video wrong and even made stuff up rather than adhering to the facts in the source text it was summarizing.

But this does show how fast AI technology is moving from large data centers into something that can be run locally on a laptop. I was particularly impressed with how fast and responsive GPT4All was on my M2 Macbook Air as compared to a Lenovo Thinkbook running with a 13th generation Intel processor.

Both chat clients allow the user to choose from a number of different large language models. Although I only looked at three of those models in the video, there are many more offered as a free download to explore. These models are being updated all the time so I’m sure we’ll see some rapid improvements as the year progresses.

ChatGPT Saves Me Time by Converting YouTube Transcripts to Blog Posts

I’ve been around for awhile in the tech media space so I’m always weary when the next new “shiny object” emerges on the scene. Google Glass, VR, crypto and NFTs were mega hyped by influencers only to fall way short when it came to mass consumer adoption.

Over the last several months the chattering influencer class has shifted focus almost entirely to artificial intelligence (AI) driven by the very rapid advancements in Large Language Model (LLM) chatbots like ChatGPT. I haven’t heard a peep about NFTs in months!

I approached this new technology with a healthy degree of skepticism. While it certainly has a “gee whiz” factor to it could it actually have some real utility in my day-to-day life?

I decided to pony up the $20 monthly subscription fee for ChatGPT Plus to see if it could save me some time and make my workflow more efficient. And surprisingly – it did. You can learn more in my latest video.

I’ve been using ChatGPT to help write these blog posts based on the transcripts of my YouTube videos for the last month or two. Last week ChatGPT became even more useful through the introduction of plugins that allow ChatGPT to perform tasks that go beyond its pre-existing knowledge cutoff of September, 2021.

One of the plugins I’ve been using is VoxScript, which can pull down full video transcripts from YouTube which the ChatGPT can use to produce summaries for this blog and my email newsletter.

Here’s how it works: I provide ChatGPT with the URL of my YouTube video and ask it to write a summary in the first person in a journalistic, neutral language style. ChatGPT uses VoxScript to pull down the full transcript from the video and starts writing the summary. The result is usually a well-written summary that captures the key points of the video, saving me about 30 minutes to an hour of writing time.

The AI does an impressive job of interpreting the automatically generated YouTube transcripts, even correcting inaccuracies and presenting the information in a coherent manner.

Of course, it’s not perfect, and I do have to tweak some parts to ensure it aligns with my voice and style. But overall, it can generate anywhere from 75-90% of the post depending on what the topic is. This post, for example, needed a little more work done to it by yours truly but the framework it provided was a great time saver.

As AI technology continues to evolve, I’m excited to see how it can further enhance productivity and efficiency in various fields. And AI is more than just chatbots. For example Tesla’s full self driving system is an artificial intelligence neural network running locally on their cars trained to drive a car.

As always, I’m interested in hearing about your experiences with AI. If you’ve found a practical use for AI that has improved your workflow definitely head over to YouTube and share your experiences in the comments section of the video.

Plex Amp Sonic Sage Adds ChatGPT AI Music Recommendations

In my latest video I dive into the world of AI-powered music discovery with the Plex Amp player and its new “Sonic Sage” feature. Sonic Sage uses ChatGPT to deliver playlist recommendations.

Here’s how it works: Sonic Sage interfaces with OpenAI’s GPT model. To get it running, you’ll need an API key from the OpenAI platform. There is a small cost for using this key but I’ve found it to be minimal. So far I’ve only racked up about 5 cents of cost for well over 20 queries.

Once you’ve enabled Sonic Sage, it lives right inside the search icon on your Plex Amp app. ChatGPT uses your queries to generate music recommendations. You can ask it for anything, from general genres to very specific prompts. For example, you could ask for “high energy, lesser-known female rockers from the last 20 years”, and Sonic Sage will whip up a playlist to match.

The AI’s recommendations are based on how you word your prompts. While it’s not perfect at always getting things right, it does a pretty solid job of delivering great music to match what you’re looking for. The only drawback I’ve noticed so far is that these AI-generated playlists can’t be saved, but I’m sure this could change in the future.

This feature works best with a very large personal library or with Tidal, a subscription music service that integrates with Plex and Plex Amp. Tidal costs $8.99 a month if you subscribe through Plex and delivers all of its music as CD quality lossless FLAC audio. I covered the Tidal integration in a previous video.

In my view, Sonic Sage adds an interesting new dimension to Plex Amp’s already awesome music discovery capabilities.