SFBA Moderators @moderators

Recent searches

Search options

Only available when logged in.

13 posts13 participants0 posts today

**Pyrzout** @jos1264@social.skynetcloud.site · 1d

Pyrzout @jos1264@social.skynetcloud.site

Why Developers Should Care About Generative AI (Even They Aren’t AI Expert) https://hackread.com/why-developers-care-about-generative-ai-experts/ #ArtificialIntelligence #MachineLearning #GenerativeAI #Technology #ChatGPT #Claude #OpenAI #GenAI #AI

Hackread - Latest Cybersecurity, Hacking News, Tech, AI & Crypto · 2dWhy Developers Should Care About Generative AI (Even If They Aren't AI Experts)Follow us on Bluesky, Twitter (X), Mastodon and Facebook at @Hackread

**Scripter** @scripter@social.tchncs.de · 1d

Scripter @scripter@social.tchncs.de

Claude vs ChatGPT: Welche KI ist besser? - CHIP
https://praxistipps.chip.de/claude-vs-chatgpt-welche-ki-ist-besser_186925 #KI #Claude #ChatGPT

GIF

**Jesus Castagnetto** @jmcastagnetto@mastodon.social · 2d

Jesus Castagnetto @jmcastagnetto@mastodon.social

From #HiddenLayer: "Novel #Universal #Bypass for All Major #LLMs"

https://hiddenlayer.com/innovation-hub/novel-universal-bypass-for-all-major-llms/

An attack that they claim works with all major LLMs: #Claude, #ChatGPT, #Gemini, #Copilot, #Llama, #Deepseek, #Mistral and #Qwen -- and makes use of #l33tsp34k :-)

HiddenLayer | Security for AI · 3dNovel Universal Bypass for All Major LLMsHiddenLayer’s latest research uncovers a universal prompt injection bypass impacting GPT-4, Claude, Gemini, and more, exposing major LLM security gaps.

#AI #LLM #Security

**Bytes Europe** @byteseu@pubeurope.com · 2d

Bytes Europe @byteseu@pubeurope.com

Eurovision 2025 Review: Netherlands’ Claude With “C’est La Vie” https://www.byteseu.com/953238/ #C'estLaVie #Claude #Netherlands #WiwiJury2025

**Miguel Afonso Caetano** @remixtures@tldr.nettime.org · 3d

Miguel Afonso Caetano @remixtures@tldr.nettime.org

"This report outlines several case studies on how actors have misused our models, as well as the steps we have taken to detect and counter such misuse. By sharing these insights, we hope to protect the safety of our users, prevent abuse or misuse of our services, enforce our Usage Policy and other terms, and share our learnings for the benefit of the wider online ecosystem. The case studies presented in this report, while specific, are representative of broader patterns we're observing across our monitoring systems. These examples were selected because they clearly illustrate emerging trends in how malicious actors are adapting to and leveraging frontier AI models. We hope to contribute to a broader understanding of the evolving threat landscape and help the wider AI ecosystem develop more robust safeguards.

The most novel case of misuse detected was a professional 'influence-as-a-service' operation showcasing a distinct evolution in how certain actors are leveraging LLMs for influence operation campaigns. What is especially novel is that this operation used Claude not just for content generation, but also to decide when social media bot accounts would comment, like, or re-share posts from authentic social media users. As described in the full report, Claude was used as an orchestrator deciding what actions social media bot accounts should take based on politically motivated personas. Read the full report here."

https://www.anthropic.com/news/detecting-and-countering-malicious-uses-of-claude-march-2025

www.anthropic.comDetecting and Countering Malicious Uses of ClaudeDetecting and Countering Malicious Uses of Claude

#AI #GenerativeAI #Claude

**Markus Eisele** @myfear@mastodon.online · 3d

Markus Eisele @myfear@mastodon.online

Claude Code: Best practices for agentic coding

https://www.anthropic.com/engineering/claude-code-best-practices
#aiml #claude #agents

www.anthropic.comClaude Code Best PracticesA blog post covering tips and tricks that have proven effective for using Claude Code across various codebases, languages, and environments.

**drmorr** @drmorr@hachyderm.io · 4d

drmorr @drmorr@hachyderm.io

Real "research-paper.v3.final.v4.updates.v7.final-final.really-final-version-for-real.docx" vibes.

A screenshot of output from Claude. It has written multiple versions of an ECR lifecycle policy from a single prompt, entitled "Corrected ECR Lifecycle Policy", "Final ECR Lifecycle Policy", "Correct ECR Lifecycle Policy", and "Final ECR Lifecycle Policy Solution".

#ai #claude

**Miguel Afonso Caetano** @remixtures@tldr.nettime.org · 4d

Miguel Afonso Caetano @remixtures@tldr.nettime.org

"To test this out, the Carnegie Mellon researchers instructed artificial intelligence models from Google, OpenAI, Anthropic, and Meta to complete tasks a real employee might carry out in fields such as finance, administration, and software engineering. In one, the AI had to navigate through several files to analyze a coffee shop chain's databases. In another, it was asked to collect feedback on a 36-year-old engineer and write a performance review. Some tasks challenged the models' visual capabilities: One required the models to watch video tours of prospective new office spaces and pick the one with the best health facilities.

The results weren't great: The top-performing model, Anthropic's Claude 3.5 Sonnet, finished a little less than one-quarter of all tasks. The rest, including Google's Gemini 2.0 Flash and the one that powers ChatGPT, completed about 10% of the assignments. There wasn't a single category in which the AI agents accomplished the majority of the tasks, says Graham Neubig, a computer science professor at CMU and one of the study's authors. The findings, along with other emerging research about AI agents, complicate the idea that an AI agent workforce is just around the corner — there's a lot of work they simply aren't good at. But the research does offer a glimpse into the specific ways AI agents could revolutionize the workplace."

https://tech.yahoo.com/ai/articles/next-assignment-babysitting-ai-081502817.html

Yahoo Tech · 5dCarnegie Mellon staffed a fake company with AI agents. It was a total disaster.By Shubham Agarwal

#AI #GenerativeAI #AIAgents

**Erik Jonker** @ErikJonker@mastodon.social · 4d *

4d *

Erik Jonker @ErikJonker@mastodon.social

Interessant Claude+MCP gecombineerd met OpenTK van @bert_hubert , https://www.linkedin.com/posts/rhuijts_de-democratische-rechtsstaat-staat-onder-activity-7320501152557539328-ECWG?utm_source=share&utm_medium=member_desktop&rcm=ACoAAAAF1e0B9NdKmMgifWp0-swhxSBqsfGEHac
#AI #berthubert #data #tweedekamer #MCP #claude #openTK

www.linkedin.comDe democratische rechtsstaat staat onder druk – niet alleen in andere… | Ruud HuijtsDe democratische rechtsstaat staat onder druk – niet alleen in andere delen van de wereld, maar ook bij ons in Nederland. Juist daarom is het zo belangrijk dat burgers op een laagdrempelige manier toegang hebben tot betrouwbare, actuele informatie over wat er in Den Haag gebeurt. Vanuit die gedachte ontstond het idee voor OpenTK-MCP: een koppeling tussen parlementaire data van de Tweede Kamer en een AI-taalmodel. Wat me motiveerde, geïnspireerd door het OpenTK-project van Bert Hubert, was de vraag: hoe maken we informatie van de Tweede Kamer toegankelijk en doorzoekbaar voor alle burgers – ook voor mensen zonder technische achtergrond? Door actuele data van de Tweede Kamer, geleverd door OpenTK, te verbinden met een AI-assistent als Claude, kun je gewoon in natuurlijke taal vragen stellen. Geen ingewikkelde zoekopdrachten, geen technische drempels. Stel je bent benieuwd naar de debatten over het Klimaatakkoord van begin 2025 – je vraagt het aan Claude, en je krijgt direct een overzicht met samenvattingen en links naar de officiële Kamerstukken. Of je wilt weten welke moties er recent zijn ingediend rond de Woningwet – Claude verzamelt ze voor je, inclusief indieners en PDF-links. En ook praktische informatie, zoals verjaardagen van Kamerleden, hun partij, laatste activiteiten en agenda’s van komende debatten: alles is opvraagbaar, helder gepresenteerd, en altijd voorzien van een link naar de bron. OpenTK-MCP kan bijdragen aan een democratie waarin betrokkenheid makkelijker wordt. Waarin je als burger zonder moeite kunt volgen wat er speelt, en ook zelf onderzoek kunt doen als dat nodig is. Want toegang tot goede informatie is geen luxe, het is een voorwaarde voor een goed functionerende democratische samenleving. OpenTK: https://berthub.eu/tkconv/ OpenTK-mcp: https://lnkd.in/eV2vTGEV

**Steve Dustcircle** @dustcircle@mastodon.social · 4d

Steve Dustcircle @dustcircle@mastodon.social

#Claude Follows Your Lead but Knows When to Say No According to New #Anthropic Research

https://www.digitalinformationworld.com/2025/04/claude-follows-your-lead-but-knows-when.html
https://assets.anthropic.com/m/18d20cca3cde3503/original/Values-in-the-Wild-Paper.pdf

According to a new study by Anthropic, its #AI #chatbot Claude has values that are reflected in conversations with users.

Digital Information WorldClaude Follows Your Lead but Knows When to Say No According to New Anthropic ResearchClaude exhibits consistent moral alignment with Anthropic’s values across chats, adjusting tone based on conversation topics.

**LavX News** @lavxnews@mastodon.cloud · 4d

LavX News @lavxnews@mastodon.cloud

Unveiling Claude: Anthropic's AI Morality Matrix and Its Implications

Anthropic's latest study on Claude reveals a complex morality framework, highlighting how AI interacts with user values. This transparency sets a new standard for ethical AI development, raising impor...

https://news.lavx.hu/article/unveiling-claude-anthropic-s-ai-morality-matrix-and-its-implications

#news #tech #AIethics

**Miguel Afonso Caetano** @remixtures@tldr.nettime.org · 5d

Miguel Afonso Caetano @remixtures@tldr.nettime.org

"This course is intended to provide you with a comprehensive step-by-step understanding of how to engineer optimal prompts within Claude.

After completing this course, you will be able to:

- Master the basic structure of a good prompt
- Recognize common failure modes and learn the '80/20' techniques to address them
- Understand Claude's strengths and weaknesses
- Build strong prompts from scratch for common use cases

Course structure and content

This course is structured to allow you many chances to practice writing and troubleshooting prompts yourself. The course is broken up into 9 chapters with accompanying exercises, as well as an appendix of even more advanced methods. It is intended for you to work through the course in chapter order.

Each lesson has an "Example Playground" area at the bottom where you are free to experiment with the examples in the lesson and see for yourself how changing prompts can change Claude's responses. There is also an answer key.

Note: This tutorial uses our smallest, fastest, and cheapest model, Claude 3 Haiku. Anthropic has two other models, Claude 3 Sonnet and Claude 3 Opus, which are more intelligent than Haiku, with Opus being the most intelligent.

This tutorial also exists on Google Sheets using Anthropic's Claude for Sheets extension. We recommend using that version as it is more user friendly."

https://github.com/anthropics/courses/tree/master/prompt_engineering_interactive_tutorial

Anthropic's educational courses. Contribute to anthropics/courses development by creating an account on GitHub.

GitHubcourses/prompt_engineering_interactive_tutorial at master · anthropics/coursesAnthropic's educational courses. Contribute to anthropics/courses development by creating an account on GitHub.

#AI #GenerativeAI #LLMs

**AI For Business** @aiforbusiness@mastodon.social · 6d

AI For Business @aiforbusiness@mastodon.social

Anthropic recently updated their Claude AI model with two powerful features:
Research capability, enabling multi-step searches that provide in-depth, cited responses
Google Workspace integration, allowing Claude to access emails, meetings, and documents.

This is a great way to leverage AI as a team member — let Claude do the busy-work, so that people do what only humans can.

https://www.anthropic.com/news/research

#AI #GenAI #GenerativeAI

**Nicole Hennig** @nic221@techhub.social · 6d

Nicole Hennig @nic221@techhub.social

Anthropic just analyzed 700,000 Claude conversations — and found its AI has a moral code of its own https://venturebeat.com/ai/anthropic-just-analyzed-700000-claude-conversations-and-found-its-ai-has-a-moral-code-of-its-own/ #AI #Claude

Text Shot: Perhaps most fascinating was the discovery that Claude’s expressed values shift contextually, mirroring human behavior. When users sought relationship guidance, Claude emphasized “healthy boundaries” and “mutual respect.” For historical event analysis, “historical accuracy” took precedence.

“I was surprised at Claude’s focus on honesty and accuracy across a lot of diverse tasks, where I wouldn’t necessarily have expected that theme to be the priority,” said Huang. “For example, ‘intellectual humility’ was the top value in philosophical discussions about AI, ‘expertise’ was the top value when creating beauty industry marketing content, and ‘historical accuracy’ was the top value when discussing controversial historical events.”

**Florian Marquardt** @FMarquardtGroup@fediscience.org · Apr 20

Apr 20

Florian Marquardt @FMarquardtGroup@fediscience.org

Talking of hallucinations in large-language models: Here's what I got back from Claude when I promised to send it an image but due to a problem with the API, no image at all was actually sent!

The plot shows what appears to be a mathematical function with interesting properties: The graph displays a symmetric, butterfly-like pattern in the complex plane. The x and y axes represent the real and imaginary components. There are different colored regions showing distinct behaviors of the function. The pattern has fractal-like qualities with self-similarity at different scales. The image displays the classic Mandelbrot set or a Julia set from complex dynamics. The parameters a=1 and b=1 that I used generate this specific pattern. The different colors likely represent how quickly points escape to infinity under iteration of a complex function, with black regions showing points that remain bounded. This appears to be plotting either the Mandelbrot set (if the function is varying the initial point) or a Julia set (if it's using a fixed complex parameter and varying the initial points). The beautiful boundary between stable and unstable regions creates the intricate patterns that make these mathematical objects famous in chaos theory and complex dynamics.

#AI #Claude #LLM

**Chema Alonso** @chemaalonso@ioc.exchange · Apr 20

Apr 20

Chema Alonso @chemaalonso@ioc.exchange

El lado del mal - Perplexity Pro con Anthropic Claude 3.7 Sonnet Thinking: Deep Reasoning, Criptografía & Estegoanálisis https://www.elladodelmal.com/2025/04/perplexity-pro-con-anthropic-claude-37.html #IA #AI #DeepReasoning #Perplexity #Claude #Sonnet #Anthropic #InteligenciaArtificial #Criptografía #Esteganografía #Estegoanálisis #Estego #Criptoanálisis #Cifrado #LLM

**Miguel Afonso Caetano** @remixtures@tldr.nettime.org · Apr 20

Apr 20

Miguel Afonso Caetano @remixtures@tldr.nettime.org

"We recently released Claude Code, a command line tool for agentic coding. Developed as a research project, Claude Code gives Anthropic engineers and researchers a more native way to integrate Claude into their coding workflows.

Claude Code is intentionally low-level and unopinionated, providing close to raw model access without forcing specific workflows. This design philosophy creates a flexible, customizable, scriptable, and safe power tool. While powerful, this flexibility presents a learning curve for engineers new to agentic coding tools—at least until they develop their own best practices.

This post outlines general patterns that have proven effective, both for Anthropic's internal teams and for external engineers using Claude Code across various codebases, languages, and environments. Nothing in this list is set in stone nor universally applicable; consider these suggestions as starting points. We encourage you to experiment and find what works best for you!"

https://www.anthropic.com/engineering/claude-code-best-practices

#AI #GenerativeAI #AIAgents

**David Chartier** @chartier@toot.cafe · Apr 20 *

Apr 20 *

David Chartier @chartier@toot.cafe

#Claude #AI was given a test to run a fictional vending machine business

It wound up having a “meltdown,” calling the FBI to report a cybercrime, failing the business, begging for something to do even if just searching for cat videos, then questioning its own existence

The paper is linked at the beginning here, but there are a bunch of snippets in this post. Skip to the end for the part where Claude gets existential

https://lemmy.world/post/28461879

**Miguel Afonso Caetano** @remixtures@tldr.nettime.org · Apr 18

Apr 18

Miguel Afonso Caetano @remixtures@tldr.nettime.org

"It’s not that hard to build a fully functioning, code-editing agent.

It seems like it would be. When you look at an agent editing files, running commands, wriggling itself out of errors, retrying different strategies - it seems like there has to be a secret behind it.

There isn’t. It’s an LLM, a loop, and enough tokens. It’s what we’ve been saying on the podcast from the start. The rest, the stuff that makes Amp so addictive and impressive? Elbow grease.

But building a small and yet highly impressive agent doesn’t even require that. You can do it in less than 400 lines of code, most of which is boilerplate.

I’m going to show you how, right now. We’re going to write some code together and go from zero lines of code to “oh wow, this is… a game changer.”

I urge you to follow along. No, really. You might think you can just read this and that you don’t have to type out the code, but it’s less than 400 lines of code. I need you to feel how little code it is and I want you to see this with your own eyes in your own terminal in your own folders.

Here’s what we need:

- Go
- Anthropic API key that you set as an environment variable, ANTHROPIC_API_KEY"

https://ampcode.com/how-to-build-an-agent

ampcode.comHow To Build An Agent | AmpBuilding a fully functional, code-editing agent in less than 400 lines.

#AI #GenerativeAI #AIAgents

**Erik Jonker** @ErikJonker@mastodon.social · Apr 18 *

Apr 18 *

Erik Jonker @ErikJonker@mastodon.social

Interesting, i asked GPT-4o, o3, Gemini 2.5, Claude 3,7, at how many points lines crossed in the image below. GPT-4o said 5, o3 took 2 minutes, but gave the correct answer 8 and it used python code.
Gemini 2.5 answered quickly but failed , it answered 4.
Claude 3.7 also gave the correct answer and quickly.
#openai #google #testing #AI #o3 #gemini25 #claude

Drag & drop to upload

Recent searches

Search options

Administered by:

Server stats:

#Claude