digitalcourage.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
Diese Instanz wird betrieben von Digitalcourage e.V. für die Allgemeinheit. Damit wir das nachhaltig tun können, erheben wir einen jährlichen Vorausbeitrag von 1€/Monat per SEPA-Lastschrifteinzug.

Server stats:

854
active users

#chatbots

13 posts12 participants1 post today

#statusai #ai #chatbots

Just heard about this app/site today. Status AI is a social media app, but every interaction you get, good ahd bad, is from digital sentients (ai chatbots) instead of analog secretors (us/humans). What a boon for hikikimori all around the world! I’m definitely curious.

#CommunityContent

Wenn generative #Chatbots mit #Internetsuche falsche #Quellen zitieren und sich dabei sicher geben, wird es gefährlich. Eine Untersuchung des Tow Center for Digital Journalism hat acht KI-Suchmaschinen unter die Lupe genommen und geschaut, wie gut die KI-Syseme im Umgang mit Originalartikeln abschneiden.

#KI #Fehlinformation #Quellenangaben #Medienkompetenz #DigitaleRecherche #Faktenprüfung #Credits

tino-eberl.de/uncategorized/ki

Tino Eberl · KI-Suchmaschinen im Faktencheck: 60 % der Zitate sind falsch
More from Tino Eberl

Researchers claim breakthrough in fight against AI’s frustrating #security hole

In the #AI world, a #vulnerability called "prompt injection" has haunted developers since #chatbots went mainstream in 2022. Despite numerous attempts to solve this fundamental vulnerability—the digital equivalent of whispering secret instructions to override a system's intended behavior—no one has found a reliable solution. Until now, perhaps.
#promptinjection

arstechnica.com/information-te

Illustration graphic of beautiful texture or pattern formation on the camel body shape, isolated on black background.
Ars Technica · Researchers claim breakthrough in fight against AI’s frustrating security holeBy Benj Edwards

Sex-Fantasy #Chatbots Are Leaking a Constant Stream of #Explicit Messages

Some misconfigured #AI chatbots are pushing people’s #chats to the open web—revealing #sexual prompts and conversations that include descriptions of child sexual abuse.
#privacy #security #leak

wired.com/story/sex-fantasy-ch

WIRED · Sex-Fantasy Chatbots Are Leaking a Constant Stream of Explicit MessagesBy Matt Burgess

This tells us a lot about how the lives of an increasing number of human beings are so empty of social contact with other human beings that they need to enter into false relationships with chatbots governed by neural networks and statistical probabilities...

"More and more of us are using LLMs to find purpose and improve ourselves.

Therapy and Companionship is now the #1 use case. This use case refers to two distinct but related use cases. Therapy involves structured support and guidance to process psychological challenges, while companionship encompasses ongoing social and emotional connection, sometimes with a romantic dimension. I grouped these together last year and this year because both fulfill a fundamental human need for emotional connection and support.

Many posters talked about how therapy with an AI model was helping them process grief or trauma. Three advantages to AI-based therapy came across clearly: It’s available 24/7, it’s relatively inexpensive (even free to use in some cases), and it comes without the prospect of judgment from another human being. The AI-as-therapy phenomenon has also been noticed in China. And although the debate about the full potential of computerized therapy is ongoing, recent research offers a reassuring perspective—that AI-delivered therapeutic interventions have reached a level of sophistication such that they’re indistinguishable from human-written therapeutic responses.

A growing number of professional services are now being partially delivered by generative AI—from therapy and medical advice to legal counsel, tax guidance, and software development."

hbr.org/2025/04/how-people-are

Harvard Business Review · How People Are Really Using Gen AI in 2025Last year, HBR published a piece on how people are using gen AI. Much has happened over the past 12 months. We now have Custom GPTs—AI tailored for narrower sets of requirements. New kids are on the block, such as DeepSeek and Grok, providing more competition and choice. Millions of ears pricked up as Google debuted their podcast generator, NotebookLM. OpenAI launched many new models (now along with the promise to consolidate them all into one unified interface). Chain-of-thought reasoning, whereby AI sacrifices speed for depth and better answers, came into play. Voice commands now enable more and different interactions, for example, to allow us to use gen AI while driving. And costs have substantially reduced with access broadened over the past twelve hectic months. With all of these changes, we’ve decided to do an updated version of the article based on data from the past year. Here’s what the data shows about how people are using gen AI now.

"If you’re new to prompt injection attacks the very short version is this: what happens if someone emails my LLM-driven assistant (or “agent” if you like) and tells it to forward all of my emails to a third party?
(...)
The original sin of LLMs that makes them vulnerable to this is when trusted prompts from the user and untrusted text from emails/web pages/etc are concatenated together into the same token stream. I called it “prompt injection” because it’s the same anti-pattern as SQL injection.

Sadly, there is no known reliable way to have an LLM follow instructions in one category of text while safely applying those instructions to another category of text.

That’s where CaMeL comes in.

The new DeepMind paper introduces a system called CaMeL (short for CApabilities for MachinE Learning). The goal of CaMeL is to safely take a prompt like “Send Bob the document he requested in our last meeting” and execute it, taking into account the risk that there might be malicious instructions somewhere in the context that attempt to over-ride the user’s intent.

It works by taking a command from a user, converting that into a sequence of steps in a Python-like programming language, then checking the inputs and outputs of each step to make absolutely sure the data involved is only being passed on to the right places."

simonwillison.net/2025/Apr/11/

Simon Willison’s WeblogCaMeL offers a promising new direction for mitigating prompt injection attacksIn the two and a half years that we’ve been talking about prompt injection attacks I’ve seen alarmingly little progress towards a robust solution. The new paper Defeating Prompt Injections …

"Finally, AI can fact-check itself. One large language model-based chatbot can now trace its outputs to the exact original data sources that informed them.

Developed by the Allen Institute for Artificial Intelligence (Ai2), OLMoTrace, a new feature in the Ai2 Playground, pinpoints data sources behind text responses from any model in the OLMo (Open Language Model) project.

OLMoTrace identifies the exact pre-training document behind a response — including full, direct quote matches. It also provides source links. To do so, the underlying technology uses a process called “exact-match search” or “string matching.”

“We introduced OLMoTrace to help people understand why LLMs say the things they do from the lens of their training data,” Jiacheng Liu, a University of Washington Ph.D. candidate and Ai2 researcher, told The New Stack.

“By showing that a lot of things generated by LLMs are traceable back to their training data, we are opening up the black boxes of how LLMs work, increasing transparency and our trust in them,” he added.

To date, no other chatbot on the market provides the ability to trace a model’s response back to specific sources used within its training data. This makes the news a big stride for AI visibility and transparency."

thenewstack.io/llms-can-now-tr

The New Stack · Breakthrough: LLM Traces Outputs to Specific Training DataAi2’s OLMoTrace uses string matching to reveal the exact sources behind chatbot responses

"When thinking about a large language model input and output, a text prompt (sometimes accompanied by other modalities such as image prompts) is the input the model uses to predict a specific output. You don’t need to be a data scientist or a machine learning engineer – everyone can write a prompt. However, crafting the most effective prompt can be complicated. Many aspects of your prompt affect its efficacy: the model you use, the model’s training data, the model configurations, your word-choice, style and tone, structure, and context all matters. Therefore, prompt engineering is an iterative process. Inadequate prompts can lead to ambiguous, inaccurate responses, and can hinder the model’s ability to provide meaningful output.

When you chat with the Gemini chatbot, you basically write prompts, however this whitepaper focuses on writing prompts for the Gemini model within Vertex AI or by using the API, because by prompting the model directly you will have access to the configuration such as temperature etc.

This whitepaper discusses prompt engineering in detail. We will look into the various prompting techniques to help you getting started and share tips and best practices to become a prompting expert. We will also discuss some of the challenges you can face while crafting prompts."

kaggle.com/whitepaper-prompt-e

www.kaggle.comPrompt Engineering

A hot topic in #GLAMR -- and this paper makes a good fist of understanding AI's limitations. Large differentials between #ChatGPT, and #CoPilot / #Gemini.

"this [study] underscores the continued importance of human labor in subject #cataloging work [...] #AI tools may prove more valuable in assisting catalogers especially in subject heading assignment, but continued testing & assessment will be needed..."

AI #Chatbots & Subject #Cataloging: A Performance Test doi.org/10.5860/lrts.69n2.8440 #metadata

PYOK: The British Airways Customer Service Chatbot is So Bad It Doesn’t Even Know Where The Airline is Based. “The conversation started with a fairly simple question as the chatbot asked Paddy to tell it where he was flying. The chatbot then suggested that Paddy either type the city or airport code – such as London or LHR for London Heathrow. Paddy replied with LHR, but having just given […]

https://rbfirehose.com/2025/04/08/pyok-the-british-airways-customer-service-chatbot-is-so-bad-it-doesnt-even-know-where-the-airline-is-based/

ResearchBuzz: Firehose | Individual posts from ResearchBuzz · PYOK: The British Airways Customer Service Chatbot is So Bad It Doesn’t Even Know Where The Airline is Based | ResearchBuzz: Firehose
More from ResearchBuzz: Firehose