• WKND AI
  • Posts
  • Is Claude The New ChatGPT Killer?

Is Claude The New ChatGPT Killer?

+ JPMorgan's AI Cuts Work By 90% and The US Army Plays AI War Games

Hello WKND AI Warriors!

Anthropic rolls out Claude 3, and you guessed it, thereโ€™s THREE models: Haiku, Sonnet, and Opus.

Also, JPMorganโ€™s AI cuts work by 90%.

Plus, Biden calls out AI during his State Of The Union Address.

Oh yeah, and the US Army is playing war games with AI. Are we ready for the future of warfare?

So, grab your beverage of choice.

Hereโ€™s your weekly dose of AI news.

Todayโ€™s newsletter includes:

  • ๐Ÿ“ฐ AI NEWS RECAP

  • ๐Ÿคฟ AI DEEP DIVE

  • ๐Ÿ› ๏ธ AI TOOL OF THE WEEK

  • โš™๏ธ GPT OF THE WEEK

  • ๐Ÿ“ AI PROMPT OF THE WEEK

  • ๐ŸŽจ AI IMAGE OF THE WEEK

๐Ÿ“ฐ AI NEWS RECAP

Is Claude 3 the new ChatGPT Killer?

The answer might surprise you.

But first...who the heck is Claude?

Claude is a large language model created by Anthropic.

That doesn't help...what the heck is an Anthropic?

It's a company that was founded by former OpenAI employees โ€”

Who weren't happy with the direction the company was going.

So they setoff to create their own LLM.

While others rushed AI capabilities,

Anthropic pumped the brakes.

Insisting on rigorous safety constraints.

And so became Claude, their answer to ChatGPT.

๐˜š๐˜ฐ, ๐˜ธ๐˜ฉ๐˜บ ๐˜ช๐˜ด ๐˜ฆ๐˜ท๐˜ฆ๐˜ณ๐˜บ๐˜ฐ๐˜ฏ๐˜ฆ ๐˜ง๐˜ณ๐˜ฆ๐˜ข๐˜ฌ๐˜ช๐˜ฏ๐˜จ ๐˜ฐ๐˜ถ๐˜ต ๐˜ข๐˜ฃ๐˜ฐ๐˜ถ๐˜ต ๐˜Š๐˜ญ๐˜ข๐˜ถ๐˜ฅ๐˜ฆ 3?

โœ… ๐—œ๐˜ ๐—ฆ๐— ๐—”๐—ฆ๐—›๐—˜๐—ฆ ๐—ฏ๐—ฒ๐—ป๐—ฐ๐—ต๐—บ๐—ฎ๐—ฟ๐—ธ ๐˜€๐—ฐ๐—ผ๐—ฟ๐—ฒ๐˜€ ๐—ฎ๐—ฐ๐—ฟ๐—ผ๐˜€๐˜€ ๐˜๐—ต๐—ฒ ๐—ฏ๐—ผ๐—ฎ๐—ฟ๐—ฑ!

Coding, analytics, reasoning - you name it.

โœ… ๐—œ๐˜ ๐—ฐ๐—ผ๐—บ๐—ฒ๐˜€ ๐—ถ๐—ป ๐—บ๐˜‚๐—น๐˜๐—ถ๐—ฝ๐—น๐—ฒ ๐—บ๐—ผ๐—ฑ๐—ฒ๐—น ๐˜€๐—ถ๐˜‡๐—ฒ๐˜€ ๐—ณ๐—ผ๐—ฟ ๐—ฒ๐˜ƒ๐—ฒ๐—ฟ๐˜† ๐—ป๐—ฒ๐—ฒ๐—ฑ.

๐—›๐—ฎ๐—ถ๐—ธ๐˜‚: Lightweight, budget-friendly

โ†ณ Think customer service chatbots

๐—ฆ๐—ผ๐—ป๐—ป๐—ฒ๐˜: Balance of speed & capability

โ†ณ Think code generation

๐—ข๐—ฝ๐˜‚๐˜€: Cutting-edge, high-powered

โ†ณ Think financial modeling

Allowing companies to level up based on their needs.

โœ… ๐—œ๐˜ ๐—ต๐—ฎ๐˜€ ๐Ÿฎ๐Ÿฌ๐Ÿฌ๐—ž ๐—ฐ๐—ผ๐—ป๐˜๐—ฒ๐˜…๐˜ ๐˜„๐—ถ๐—ป๐—ฑ๐—ผ๐˜„๐˜€ ๐˜„๐—ถ๐˜๐—ต ๐—ป๐—ฒ๐—ฎ๐—ฟ ๐—ฝ๐—ฒ๐—ฟ๐—ณ๐—ฒ๐—ฐ๐˜ ๐—ฟ๐—ฒ๐—ฐ๐—ฎ๐—น๐—น.

Larger context windows means more data you can input.

But more important โ€”

It can find the "๐—ก๐—ฒ๐—ฒ๐—ฑ๐—น๐—ฒ ๐—œ๐—ป ๐—” ๐—›๐—ฎ๐˜†๐˜€๐˜๐—ฎ๐—ฐ๐—ธ" (NIAH).

๐˜ž๐˜ข๐˜ช๐˜ต...๐˜ธ๐˜ฉ๐˜ข๐˜ต ๐˜ฅ๐˜ฐ ๐˜ฏ๐˜ฆ๐˜ฆ๐˜ฅ๐˜ญ๐˜ฆ๐˜ด ๐˜ช๐˜ฏ ๐˜ข ๐˜ฉ๐˜ข๐˜บ๐˜ด๐˜ต๐˜ข๐˜ค๐˜ฌ ๐˜ฉ๐˜ข๐˜ท๐˜ฆ ๐˜ต๐˜ฐ ๐˜ฅ๐˜ฐ ๐˜ธ๐˜ช๐˜ต๐˜ฉ ๐˜ˆ๐˜?

Pioneered by AI researcher Gary Marcus,

NIAH is where large amounts of text are given to an LLM.

๐—ง๐—ต๐—ฒ ๐—ต๐—ฎ๐˜†๐˜€๐˜๐—ฎ๐—ฐ๐—ธ.

With a random sentence placed in the middle of the text.

๐—ง๐—ต๐—ฒ ๐—ป๐—ฒ๐—ฒ๐—ฑ๐—น๐—ฒ.

Then the model is asked question about the random sentence.

Historically models do well with the beginning and end of text.

But fail and hallucinate with text in the middle.

Well, Claude passes the NIAH test with flying colors!

โœ… ๐—œ๐˜ ๐—ต๐—ฎ๐˜€ ๐—ณ๐—ฒ๐˜„๐—ฒ๐—ฟ ๐—ฟ๐—ฒ๐—ณ๐˜‚๐˜€๐—ฎ๐—น๐˜€, ๐—บ๐—ผ๐—ฟ๐—ฒ ๐—ฎ๐—ฐ๐—ฐ๐˜‚๐—ฟ๐—ฎ๐—ฐ๐˜†

A lingering complaint with prior Claude models,

Was unnecessary refusal to answer queries.

Claude 3 fixed that and has more factually accurate responses.

(If you're still reading this you found the ๐Ÿชก in my post)

My Take:

Safe to say Claude is a contender.

The benchmarks show Claude beats ChatGPT across the board โ€”

And numbers don't lie.

I personally use Claude 30% of the time, and ChatGPT the rest.

So, do you think Claude is the ChatGPT killer?

Time will tell.

Claude may have won the battle today.

BUT rumor has it OpenAI already solved AGI...last year.

I'll save that topic for another day ๐Ÿ˜‰

US Army Plays War Games With AI. The US Army Research Laboratory is experimenting with generative AI chatbots like OpenAI's GPT-4 Turbo and Vision models for strategic planning in military simulations, raising both tactical possibilities and ethical concerns.

Inflection AI Unveils Its 2.5 Model. Inflection AI launches Inflection-2.5, combining high IQ with empathetic interaction, rivaling top models like GPT-4 with less compute, enhancing user engagement and information access across platforms.

State Of The Union Addresses AI. President Biden calls for a ban on AI voice impersonations during the State of the Union, addressing concerns in entertainment and tech, and reflecting SAG-AFTRA's negotiations on AI use in media.

AI Detects Kidney Failure Faster. An AI tool developed by Sheffield Teaching Hospitals NHS Foundation Trust predicts kidney failure six times faster than human experts, significantly improving diagnostics for polycystic kidney disease.

OpenAI's New Board Of Directors. OpenAI welcomes Dr. Sue Desmond-Hellmann, Nicole Seligman, and Fidji Simo to its board, enhancing governance with their diverse expertise in global leadership and technology. Sam Altman also rejoins the board.

OpenAI's Ousting Review Complete. OpenAI confirms Sam Altman and Greg Brockman will continue leading the organization after a comprehensive review. The board introduces new governance enhancements and welcomes three new members, reinforcing its commitment to responsible AI development.

OpenAI Claps Back At Elon's Lawsuit. OpenAI addresses its history with Elon Musk, emphasizing commitment to AGI for humanity's benefit. Despite Musk's departure and legal disputes, OpenAI continues its mission, distancing from Musk's approach and focusing on broad AI benefits.

Will Perplexity AI Be The Next Unicorn? Perplexity AI is finalizing a funding round that will elevate its valuation to around $1 billion, marking a significant leap from its previous $520 million valuation, showcasing the ongoing investor confidence in AI technologies.

Microsoft Dusts Off VCRs For Its Legal Defense. Microsoft references VCR technology in its defense against The New York Times' copyright lawsuit, arguing that large language models, like OpenAI's, should not be hindered by copyright law, similar to past technologies.

JPMorgan's AI Cuts Work By 90%. JPMorgan's AI-driven tool, Cash Flow Intelligence, has reduced manual work for corporate clients by nearly 90%, showcasing the bank's commitment to leveraging AI for efficiency and forecasting accuracy.

Meta's AI To Control Your Video Feed. Meta is developing a single AI model to enhance video recommendations across Facebook, aiming to increase user engagement by promoting more relevant and captivating content, such as Reels, to keep users on the platform longer.

Google AI Engineer Steals AI Secrets. Linwei Ding, a Google engineer, faces indictment for allegedly stealing AI trade secrets related to Google's TPU chips and transferring them to Chinese companies, highlighting the intensifying global AI technology race.

๐Ÿคฟ AI DEEP DIVE

We talked about the new Claude 3 in todayโ€™s feature story.

Get an in-depth look at Claude 3 and how it stacks up against ChatGPT.

This video puts both models to the test across common use cases, assessing whether Claude 3โ€™s capabilities offer enough to consider switching from the well-established GPT-4 for certain types of tasks.

๐Ÿ› ๏ธ AI TOOL OF THE WEEK

Parallel AI: Revolutionize your business with Parallel AI, a platform that crafts AI employees tailored to your company's unique data and operational needs.

These virtual specialists boost efficiency by conducting research, providing consultations, and integrating seamlessly with tools like Slack, Google Docs, and Notion, all while ensuring top-tier data privacy and security.

Send your tool here to be featured next week!

โš™๏ธ GPT OF THE WEEK

Gauntlet Movies: Are you a movie buff and think you know your stuff?

Give this movie trivia GPT a try. Itโ€™s sure to challenge you!

๐Ÿ“ AI PROMPT OF THE WEEK

Copy and paste this into your favorite chatbot.

Debate the nature of consciousness with a panel of AI philosophers from various systems and frameworks.

๐ŸŽจ AI IMAGE OF THE WEEK

Copy and paste this into your favorite image generator.

Image by armandofalcao on Midjourney.

An image of a man divided in two. One side of his face, in black and white, is a version of him as a six-year-old child while the other side is in color and an adult version of him as a 60-year-old.

Not paying for Midjourney or DALL-E 3?
Click here for Microsoftโ€™s FREE image creator.

Send your image here to be featured next week!

LAST WEEK FROM OUR READERS

Last weekโ€™s image by WKND AI reader d_kumar โ€œunicorns & rainbowsโ€

HOW CAN YOU HELP?

Did you learn something cool today?

Share your favorite takeaway on your LinkedIn from todayโ€™s newsletter and tag me for a little surprise!

Connect with me on LinkedIn.

How'd you like this newsletter?

Love it or hate it? Let us know why!

Login or Subscribe to participate in polls.

Share your feedback to make WKND AI better for YOU.

Refer our newsletter to a friend, co-worker, or family member.

Advertise in our newsletter to reach readers excited about AI.

MISSED LAST WEEKโ€™S EDITION?