AI tools don’t always provide the correct answers, so I often find myself cross-referencing multiple models to get a wider range of perspectives. Manually copy-pasting the same prompt into Claude, ChatGPT, and Gemini quickly gets tiresome.
The three main LLMs I use are Claude, ChatGPT, and Gemini. They all provide APIs that make this pretty easy to build an app.
Working with Claude Code, I built a small app that runs locally to ask all the LLMs the same question and have them discuss the answers and provide a consensus view. It’s similar to asking advice from a group chat of friends. Everything is stored locally on your computer.
My highly imaginative name for the app is llm-discussion.
It wasn’t too hard to build. Took a little time to set up the accounts correctly to get the API keys, but it wasn’t difficult. The whole thing is only about 325 lines of Python.
I asked all three about a couple of topics like vitamins and cosmology. The discussion and consensus surprised me with how deep the answers went. Also, they are exceedingly, painfully polite to each other.
The consensus includes the key points, what they agree upon, and most interestingly, what they don’t agree upon.
I put in a few options. You can choose the number of rounds of discussion and which LLMs you want included. Each round feeds the previous responses back to the models so they can critique or refine their answers.
The LLMs can be a bit verbose, so there’s a pulldown to choose concise, standard, or detailed answers.
You can save the discussions as well. All locally on your computer.
I use Windows but the code should run on macOS or Linux easily as the app is just basic Python scripting and Flask for the web UI. It would be easy to add other models like Deepseek, Llama, Mistral, or other API providers.
The tokens do cost money on Claude and ChatGPT, but it’s pennies. Gemini currently has a free API tier with a cap that I haven’t managed to hit yet.
Just another example of using Claude Code ‘to scratch that itch’ and make small things in my nerd life easier.
Last week I was scrolling TikTok, as one does, and saw this video by Sangeetha Bhatath, a software engineer. She was discussing that Andrej Karpathy had released the code for microGPT, an extremely simple version of the code used to train large language models. Karpathy is a co-founder of OpenAI and one of the leading thinkers in the space.
Sangeetha’s point is that you can try training a LLM yourself, and see what’s in the inside the black box to some degree. I was intrigued and decided to give it a try.
After a bit of chatting with Claude (the web chat AI from Anthropic), we agreed to use nanoGPT as it was able to take advantage of GPU processing. As a PC gamer, I have a reasonable video card (Nvidia 4070 Super w/12GB VRAM) that would greatly speed the training. GPUs do a lot of vector math to make video games work and coincidentally LLM training is basically the same kind of vector math. I hated linear algebra in engineering school, so I’m glad we have chips to do this for me.
The plan was to use the GPT-2 weights that are publicly available with as much data as I could gather of my own writing and speaking. In short, a plan to make a Cruftbot or CruftGPT. Claude made a detailed four phase plan that I could understand and was clear direction for Claude Code (Anthropic’s focused developer AI app) to execute.
The text you used to train a LLM is reflected in the way the LLM writes. Train a lot of Shakespeare, you get a LLM that talks like an Elizabethan. Train a lot of legal documents, you get a LLM that talks like a lawyer.
I’ve been in the interwebs for a long time and have 25 years of posting and over 300 videos of my various antics. Claude helped me write several scripts to scrape data from my weblog, Medium stories, Bluesky posts, and transcripts of my videos. Reddit has an export function, which made that easy. I have a lot of posts on Twitter, but I haven’t been posting there for a couple years now. It used to be easy to get an export of posts, but under the current management it’s extremely difficult.
I set Claude Code to work on setting up the NanoGPT code on my desktop. As an aside, wsl2 (Ubuntu linux) under Windows works very well. I fed the personal data to Claude Code and it formatted it for me. 25+ years on the internet equaling 699K tokens of data. Good, but not great.
Another aside: LLMs process text using tokens, which are the numerical building blocks of text input. Instead of reading full words, a tokenizer breaks text down into common chunks of characters. For example, the word ‘apple’ might be one token, while a complex word like ‘bioluminescence’ might be split into three or four tokens. The tokenizer assigns each unique chunk a specific number, the word ‘apple’ might be ‘27149’.
Training is essentially the LLM learning the mathematical relationships between these numbers. Since computers excel at math but don’t ‘read’ like humans, turning language into a giant game of statistics and geometry (technically it’s vector math) is what makes the magic happen.
Claude started a few training runs and tried both GPT-2 small (124M) and GPT-2 medium (345M) parameter sets to see what worked best with my personal dataset. After a bit of GPU time, it found the GPT-2 medium worked best to provide the best ‘val loss trajectory’. I learned that ‘val loss trajectory’ is tracking the validation loss number, which kinda means how well the personal data is overlaying with the base language data.
Since I want CruftBot to sound like me, it’s important the training results in my personal data being more apparent than the base language that the GPT-2 set provides.
Before bed, I told Claude to continue training and to continue without asking me for approval. The GPU was pegged at 99% but not overheating, which was great.
The next morning the training was done and Claude stood up Gradio to act as a UI with CruftBot.
The results were underwhelming.
The output used words I use, but was put together in nonsense fashion. You could see CruftBot trying, but it was just guessing at words.
Claude explained “This is the fundamental limitation of a fine-tuned model this size: it’s not a knowledge model or a chat assistant, it’s a text completion engine trained on your writing patterns. It doesn’t understand questions, it just continues text in a direction that statistically resembles your corpus.”
Claude went on to explain that what I really needed was a lot more tokens of my own data.
My own data means things I’ve written, talks I’ve given, and videos I’ve made. Asking for triple of what it took me 30 years on the internet to write, and I’m prolific compared to most netizens, is humbling. There just doesn’t exist three times more ‘me’ of data out there.
In short, I learned it’s just guessing words based on patterns of tokens in the data it was trained on and it needs a lot more data to train on. There is some truth to the idea that AIs are ‘word guessing machines’ but at the leading edge they guess as well as almost any expert human would on topics.
If I really wanted to take this further, there are other approaches to improve the result, but in the end they would all pale in comparison to the current frontier models that you can try for free.
There’s a huge value in doing technical things yourself and seeing what is involved. I learned a tremendous amount about the basics of LLM training and what kind of issues would be involved with scaling.
When I worked at NBC, we used the same Nvidia A100 & H200 cards for video editing that are now used for LLM training. They are enormously powerful GPUs. At the time, our competition in buying them was from cryptocurrency groups, not AI companies. The idea that thousands of these cards are needed to train the frontier AI modules shows me the gigantic amount of tokens that are crunched to get today’s AI bots.
Looking at this from a professional point of view, it’s easy to extrapolate from my experiment how a business might want to build its own LLM, trained on a large corpus of knowledge important to that business. It’s probably a spreadsheet of costs comparing doing it yourself with servers, GPUs, and data centers compared to paying an existing AI company to train your data on top of their models. On top of all that, does the cost of a well trained AI system pay for itself in terms of productivity and improvements? The answer on that is still undetermined, despite the current hype cycle.
We are all in the very early days of AI, despite the feeling that it’s taking over our personal worlds and most businesses. My 24-hour experiment only scratched the surface and it’s clear there’s a long way to go before any of us (developers, businesses, or society) truly understand how this technology will reshape our world.
If you are technically minded, do yourself the favor and try training your own model. It won’t end up being very usable, but you will learn a lot.
Today marks the 26th anniversary of starting this very weblog. I had a personal web site since 1997, but 2000 marks when I began traditional blogging.
My first post was celebrating getting a blogging system known as NewsPro running. In the beginning I was mainly blogging about Ultima Online, the MMO I was playing at the time.
The internet was very different at the time. There was no Facebook, Twitter, Instagram, or Wikipedia. There were no smartphones, streaming video, podcasts, or mainstream broadband. Storage was measured in megabytes, not gigabytes.
In January 2000, we were still in the era of dial-up internet, desktop computers, flip phones, DVDs, and broadcast television.
The change over the last 26 years is mind-blowing when you step back and look at it.
I changed over the years as well, going from the father of toddlers to an empty nester. My blogging evolved with me, from video games and daily routines to writing about the nascent social media and blogging scene.
Sometime in 2002 I moved from NewsPro to MovableType. At the time, the software was revolutionary.
Beyond this personal weblog, I was also experimenting in the corporate world, getting my maintenance team to write about what they did on their shifts online as opposed to paper logbooks.
I remember sending a check for $200 to Ben & Mena Trott to license the Walt Disney Company to use Movable Type. I spoke at conferences about using blogging in the workplace.
When digital cameras became affordable I was able to incorporate more images and even post video.
In those days, comments were de rigueur on weblogs and the first appearance of spam and bad actors arrived, and the endless attempts to counter them were met with varying levels of success. At the time, the dopamine from nice comments outweighed the headache of spammers.
Places to share on Reddit, Discords, and Slacks were abundant. But you can be at the mercy of moderators of varying attitudes and commenters that try to make you feel bad.
As social media became a slurry of AI slop, influencers, and bots, I realized I needed a space I could actually control, one not beholden to CEOs chasing the latest hype cycle.
In 2024, thanks to help from my friend Greg, we got this blog up and running again. Greg helped me move to WordPress, which is the de facto standard these days.
I still make videos and post them here, but also make actual blog posts about things that interest me.
I’m not trying to make money or become a dadfluencer, just happy to have a little space on the net for myself.
There is no amazing revelation or realization after 26 years of blogging.
I have no idea if people are reading what I write, and it really doesn’t matter.
It gets the ideas out of the whirlwind in my head so I can make space for new things.
I’m just happy to keep my little corner of the interwebs tidy.
Twenty-six years on, I’m still writing, not because it’s strategic or visible, but because I enjoy it.
As holiday cards began to roll in, Michele, my wife, and I discussed whether we were going to do a Christmas card ourselves. After being married 30+ years with our kids grown and out of the house, our life doesn’t lead to photos of far-flung travel and excitement.
Recently, our neighbor Emily put out a zine featuring my 20 years of Halloween costume data. I was honored and thrilled to see an actual paper zine. For those that don’t know, a zine is a small self-published booklet often made by hand.
When I saw it, I had thoughts about what I would put into a zine.
We considered various card concepts until the idea clicked: make our own zine. Not a holiday themed one, but just filled with work and ideas from both of us.
We started filling up the pages with small bits we thought might be interesting to the people on our Christmas card list. Some images of Michele’s work, several best of 2025 lists, a short essay about buying a smart TV, and a few things we had no idea if others would find interesting.
Once we finalized the content and design, I looked into printing. A few pages have color images and I found out quickly that color printing is still pretty expensive at local print shops. At over $2 in printing costs per issue, the total was adding up quickly to a big number. We decided to double down and buy our own color laser printer, as we’d probably be making zines again, not just for the holidays.
I felt very DIY as I folded and stapled the issues together. The physical work of putting a zine together is far more rewarding than uploading images to a print company on the internet. Michele helped with the card list and labels. We got a lot of feedback from friends and family and they loved it. One couple told us they’re going to make one themselves.
If you’d like a copy, you can print it from this PDF.
When printing, choose two-sided printing and flip on the short edge.
We had a blast making a zine. If you have some spare time, you might consider making one yourself.
I’ve been trying to be more consistent with alt text, but let’s be honest, writing it manually can be a chore.
Most AI chatbots do a good job of generating the alt text for me if I give them an image. They often catch details I overlook and add in information that I, as a human, would typically leave out.
To make it simpler, I wanted to right-click any image to automate getting AI-generated alt text.
It took about 15 minutes to build.
I used Claude Code to make a small Python script to send the image to the Claude API and return the result to a local web page to copy the text easily.
It was strikingly simple to do this. The script operates on Windows 11 using the Claude API.
To use it, you need to run Python, a few dependencies, and make a few registry edits to add the right-click menu option. Claude Code did this for me directly, but it can be done manually.