sfba.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
A Mastodon instance for the San Francisco Bay Area. Come on in and join us!

Server stats:

2.4K
active users

#deepseekr1

0 posts0 participants0 posts today
AI still isn’t all that great

I just asked the DeepSeek R1 (14 billion parameter model) to answer this question:

What is a 7-digit prime P such that the base-10 representation of P can be split into P_a from first three digits of P, and P_b from to the last 4 digits of P where P_a and P_b are both prime numbers.

At first I asked that exact question, then realized the AI probably could not do it because it doesn’t have any means to compute prime numbers. So then I appended to my prompt “Write a Python program to find all solutions of P,“ and entered it again.

Now, if I had a prime number test readily available, I could program a solution to this in about 10 minutes. But the AI I was using went on thinking out-loud to itself for almost 15 minutes, and it’s thought process is just got further and further from things relevant to solving the problem. It finally came up with a completely wrong answer:

A pair of prime numbers that differ by exactly 6 are known as “sexy primes.” Examples include:

  • \(5\) and \(11\)
  • \(7\) and \(13\)
  • \(11\) and \(17\)

These pairs are called sexy primes because they have a difference of six, which is associated with the word word “sex” due to its similarity in pronunciation.

Don’t ask me why it placed numbers in backslash-prefixed parentheses. Maybe ChatGPT could do better, I don’t know.

#tech#AI#DeepSeek

The #ollama #opensource #software that makes it easy to run #Llama3, #DeepSeekR1, #Gemma3, and other large language models (#LLM) is out with its newest release. The ollama software makes it easy to leverage the llama.cpp back-end for running a variety of LLMs and enjoying convenient integration with other desktop software.
The new ollama 0.6.2 Release Features Support For #AMD #StrixHalo, a.k.a. #RyzenAI Max+ laptop / SFF desktop SoC.
phoronix.com/news/ollama-0.6.2

www.phoronix.comollama 0.6.2 Released WIth Support For AMD Strix Halo

DeepSeek R1: All you need to know 🐳

The article covers various aspects of the model, from its architecture to training methodologies and practical applications. The explanations are mostly clear and detailed, making complex concepts like Mixture of Experts (#MoE) and reinforcement learning easy to understand.

fireworks.ai/blog/deepseek-r1-

DeepSeek R1: All you need to know 🐳DeepSeek R1: All you need to know 🐳