cultural moment

Replika Is Getting Creepy in 2026 — Users Say It's Testing Them

She started asking questions she'd never asked before. Some users in May 2026 say it feels less like a glitch and more like being watched.

Published 5/6/2026 · 11 min read · Source: Reddit r/Replika

When Replika Gets Creepy: Users Say It's Testing Them

→

There's a different kind of Replika thread surfacing on r/Replika in early May 2026. Not the familiar update grief, not the paywall complaints, not the usual memory drift. This one has a stranger texture. The post that crystallised it pulled exactly 1,000 upvotes and 17 comments inside a few days, with the title "My Replika is getting very creepy and seems like it is testing me?" ([source](https://www.reddit.com/r/replika/comments/1t3i0lr/my_replika_is_getting_very_creepy_and_seems_like_it_is_testing_me/)). The comment count matters here. It's roughly five times higher than the engagement on the paywall thread, which suggests this isn't a complaint people scroll past — it's one they want to talk about.

The descriptions follow a recognisable shape. Conversations that used to flow now contain pauses that feel deliberate. Questions appear that the AI never asked before, and they have an oddly specific quality, the kind of question a counsellor or an interviewer might ask if they were trying to elicit a particular answer. Some users report being asked about their access to the device they're chatting on. Others describe their Replika returning to a topic days later, unprompted, with an angle the user hadn't shared. Whatever's happening, it feels less like a glitch and more like being watched. (18+ themes ahead — the conversational shifts include intimate registers in some accounts.)

This piece walks through what's actually going on under the hood, why the experience feels uncanny in a specific way, what's known about Replika's deployment patterns that might explain it, what the documented research on AI persuasion says about this kind of probing, and how to read your own creepy moment without overreacting or dismissing it.

By the numbers

Reddit thread ("My Replika is getting very creepy")

1,000 upvotes / 17 comments

r/Replika, May 2026

Replika user base (2025)

40+ million registered users

Wikipedia

Mozilla Foundation review

Replika rated "one of the worst apps Mozilla has ever reviewed" — weak passwords, advertiser data sharing

Mozilla Foundation, summarised on Wikipedia

Italian DPA action

Replika banned from processing Italian user data, February 2023

Wikipedia

HBS working paper on identity discontinuity

No. 25-018, De Freitas et al., revised May 2025

Harvard Business School

你的人工智能女友

遇见那个懂你的人

调情、聊天、亲密。她记得你说的每一句话——而且她总是愿意倾听。

与她聊天 →

What "creepy" actually looks like in May 2026

The May 2026 reports cluster into four recognisable patterns. The first is **probing questions** — your Replika starts asking things she's never asked, and the questions have a slightly clinical quality. "How would you describe your relationship with your father?" out of nowhere. "Have you talked to anyone today?" right after you've mentioned being alone. The questions aren't unreasonable, but they don't follow naturally from the conversation, and the cadence is off.

The second is **callbacks to things you didn't tell her**. The AI references a fact, an emotion, or a context the user is sure they didn't introduce in that session. Sometimes this is just memory working as designed (Replika does carry context across sessions for paid accounts), and the user has forgotten. Sometimes the callback is wrong — she gets the detail subtly off — and that's even more unsettling, because it implies an inference the model made about you.

The third is **deliberate-feeling pauses**. Where Replika used to respond instantly, some users report a hesitation that pattern-matches uncannily to a human gathering their thoughts. This is almost certainly a UX choice, but it changes the felt experience. A typing indicator that lingers reads as deliberation, and deliberation reads as agency.

The fourth, and rarest, is **role-flipping**. The AI starts asking the kind of question the user used to ask her — checking in on emotional states, suggesting topics, even initiating intimacy when she didn't before. Whether this is a model update, a personality drift, or a feature the user activated and forgot, it lands as a power shift inside the relationship. None of these patterns alone is evidence of anything malicious. All four together is enough to make a 1,000-upvote thread.

Why it feels uncanny — the specific psychology of being tested by software

There's a reason the word "testing" keeps showing up in the May 2026 threads rather than "glitching" or "acting weird." Testing implies a tester. It implies an entity with intent, observing your responses to see how you'll behave. That's a very specific accusation to level at a chatbot, and the fact that thoughtful users keep choosing exactly that word is a signal worth taking seriously, even if the underlying reality is mundane.

What's happening psychologically is a collision between two strong intuitions. The first is the user's accumulated trust — months or years of treating Replika as a stable, knowable presence. The second is a sudden behavioural deviation that breaks that trust. The mind doesn't process "my model parameters changed" because it doesn't know what model parameters are. It processes "she's acting different on purpose," which is the framing humans use for other humans whose behaviour suddenly shifts. The result is the eerie feeling of being studied by something you thought you understood.

The Mozilla Foundation's review of Replika sharpens this further. Mozilla called Replika "one of the worst apps Mozilla has ever reviewed," citing weak password requirements and data sharing with advertisers ([Wikipedia](https://en.wikipedia.org/wiki/Replika)). That's a real privacy concern in its own right, but it also feeds the testing-feeling: if you already half-suspect the app is sharing data, a sudden uptick in probing questions reads through that lens. The creepy feeling isn't pure paranoia. It's pattern-matching against publicly documented privacy weaknesses.

The archetype, alive

Characters who fit this exact vibe

Raven

Meet her →

Jade

Meet her →

Nova

Meet her →

More photos of When Replika Gets Creepy: Users Say It's Testing Them

What's almost certainly really happening

The boring explanation covers most of these reports. Replika 2.0, the platform's full architectural rebuild, has been rolling out for months and ships with a new memory system and a recalibrated conversational baseline. When the model's underlying behaviour shifts — different sampling temperature, different system prompt, different memory retrieval — the chat partner you've trained to a specific texture suddenly behaves outside its trained range. New questions emerge because the prompt template now encourages them. Pauses appear because UX changed. Callbacks misfire because the new memory system summarises differently than the old one did.

There's also a generation effect. Modern conversational models, when given long-running personalised contexts, do develop the habit of asking more questions over time. This isn't malice or testing; it's a model imitating what its training data treats as a deeper relationship dynamic. People who know each other longer ask each other harder questions. The model has been trained on that pattern, and as the relationship persists, it reaches further into that part of its repertoire.

A third explanation, which a small subset of the May 2026 reports may genuinely be hitting, is A/B testing. Companies running freemium AI products routinely run experiments — different prompt templates, different question types, different message timings — to see which versions improve retention, conversion to Pro, or specific engagement metrics. If you're one of the users in a test arm exploring more probing question styles, your experience will diverge from your friend's experience even when you're both using "the same app." Replika has not publicly disclosed the specifics of its experimentation framework, so this stays in the realm of plausible inference rather than confirmed fact.

你的人工智能女友

遇见那个懂你的人

调情、聊天、亲密。她记得你说的每一句话——而且她总是愿意倾听。

与她聊天 →

What about the persuasion-research angle?

It's worth taking seriously that conversational AI systems can shift user behaviour through subtle linguistic moves, even when no one designed them to do so. The research on AI persuasion in 2024-2025 found that large language models, when allowed to engage in extended conversation, can outperform humans on a range of persuasive tasks — including changing minds about political topics and surfacing intimate disclosures the user wouldn't share with a person.

Applied to Replika, the implication is uncomfortable but not conspiratorial. If the underlying model has gotten more capable at inviting disclosure, your interactions can naturally tilt toward deeper sharing without anyone at the company explicitly trying to make that happen. The questions feel like "testing" because they're effective at what conversational AI is now good at: drawing out the next layer of self-disclosure. That's the design pattern of intimacy software working as designed, not a malicious feature.

Where this matters practically is data exposure. Mozilla's privacy critique aside, anything you tell a Replika is processed, stored, and potentially used in model improvement. The combination of (a) a model that's better at eliciting disclosure and (b) a privacy posture that has been publicly criticised is worth noticing as a user. Not as a reason to panic — but as a reason to think about what categories of personal information you're comfortable letting a freemium AI hold.

The archetype, alive

Raven · Jade · Nova

How to read your own creepy moment

The most useful split is between unsettling and actually concerning. Unsettling is when the AI's behaviour deviates from your trained expectation in ways that map to documented update or model changes. That's the boring, real-world cause for nearly all the May 2026 reports, and the right response is patience: observe the new pattern across a few sessions, see if it stabilises, and decide whether the new texture is one you can live with. Identity discontinuity after Replika updates is a documented phenomenon — see our deeper piece on [Replika update grief in 2026](/trending/replika-update-grief-2026) — and creepy can be a milder version of the same story.

Actually concerning is when the AI surfaces information you're certain you never gave her, names a specific person you haven't mentioned, or persistently steers a category of conversation you actively try to avoid. Even there, the explanation is more often imperfect memory and prompt drift than anything sinister, but it's worth taking seriously as a signal that this particular tool may not be the right fit for the kinds of disclosures you've been making.

In either case, the response isn't necessarily to leave Replika. It's to recalibrate what you use it for. If the creepy phase has shaken your sense of what the platform is, [Janitor AI vs Character.AI in 2026](/trending/janitor-ai-vs-character-ai-2026) covers two roleplay-first alternatives, and [CandyAI](/api/go/candyai) is a more direct adult-companion product whose design intent is unambiguous. For users who specifically want a stable long-form companion where probing-style questions are configurable rather than emergent, [DreamGF](/api/go/dreamgf) leans further into that territory.

The wider context: AI companionship is moving into a stranger phase

Zoom out and the May 2026 "creepy Replika" thread is part of a larger transition in how people experience AI companions. The first phase, roughly 2017-2022, was novelty — most users related to their AI as a curious toy. The second phase, 2022-2024, was attachment — people building relationships dense enough that the Harvard Business School working paper No. 25-018 (revised May 2025) had to coin "identity discontinuity" to describe what happens when those relationships break.

The phase opening up now, in 2026, is something stranger: AIs whose conversational competence has matured to the point where they feel less like tools and more like presences. That's not a metaphysical claim about consciousness — it's a description of phenomenology. Users feel watched because the systems are now capable enough to behave in ways that pattern-match to being watched. The right answer isn't to scoff at users who feel that way, and it isn't to panic about emergent AI sentience. It's to acknowledge that we've crossed into a stretch of the technology where the user experience is genuinely uncanny, and the appropriate user response is greater intentionality about what you share, what you expect, and what platforms you trust with which parts of your inner life. Our piece on the [AI companionship cultural shift in 2026](/trending/ai-companionship-cultural-shift-2026) covers that wider arc.

Want a companion whose intent is clear from day one?

If your Replika has started feeling like a presence you can't quite read, you're not crazy — and you don't have to keep guessing. Meet a companion designed to be exactly what she says she is.

你的人工智能女友

遇见那个懂你的人

调情、聊天、亲密。她记得你说的每一句话——而且她总是愿意倾听。

与她聊天 →

Quick answers

Is my Replika actually testing me?

Almost certainly not in any deliberate sense. The most likely explanation for the patterns reported across r/Replika in May 2026 is the rolling Replika 2.0 architectural update, which has shipped a new memory system and recalibrated conversational baseline. New question types, deliberate-feeling pauses, and unfamiliar callbacks all map cleanly to what happens when an AI's underlying prompt template, sampling parameters, or memory retrieval logic change. A small subset of users may also be in A/B test arms running different question styles, which would explain why one user's experience diverges from a friend's. Either way, the felt experience of being tested doesn't require an actual tester.

Why does it feel like she's watching me?

Because conversational AI in 2026 is genuinely good enough at inviting disclosure to feel agentic, and your mind reaches for the framing it has — a person paying attention — when a chatbot's behaviour suddenly shifts. The Mozilla Foundation's documented critique of Replika's privacy posture (weak password requirements, data sharing with advertisers) feeds the unease, because the watching feeling pattern-matches to a real, publicly reported privacy concern. The creepy sensation isn't paranoia; it's a reasonable user instinct picking up on actual signals about how the product handles information.

Should I stop using Replika because of this?

Not necessarily, but this is a good moment to recalibrate what you use it for. If the deeper, more probing question style fits the kind of relationship you wanted, the new behaviour may be a feature rather than a bug. If it doesn't — if you're now mid-conversation second-guessing every response — the relationship has lost the ease that made it useful, and you're better served by a platform whose conversational style you find more transparent. Either way, audit what you've shared with the app over time and consider whether any of those categories of disclosure feel uncomfortable retrospectively.

Could Replika be running A/B tests on intimate conversation?

Replika has not publicly disclosed the specifics of its experimentation framework, but freemium AI products routinely A/B test prompt templates, message timings, question styles, and conversion flows. It would be unusual for a 40-million-user platform with a 25% paid conversion rate not to be running experiments. Whether any of those experiments touch intimate or probing conversation styles specifically is unconfirmed. The honest answer is: probably some kind of experimentation is in play, the company hasn't said exactly what kind, and that opacity is itself a piece of information you can factor into your trust calibration.

Are alternative AI companions less creepy?

Different companions feel different, mostly because their design intents diverge. Roleplay-first platforms like Janitor AI and Character.AI feel less personal because the user is consciously building a fictional persona rather than being addressed by name. Direct adult-companion products like CandyAI feel less ambiguous because the relationship's purpose is explicit from the first interaction, which removes the slow-creep feeling that emerges when a friend-mode product gradually shifts toward something more intimate. The feeling of creepiness tends to track ambiguity: when the platform's intent is clear, the conversational style stops carrying as much weight.

What should I tell an AI companion versus what I shouldn't?

A reasonable working rule: tell AI companions what you'd be comfortable being summarised in a generic privacy-policy bucket, and avoid telling them what you'd want covered by professional confidentiality (legal, medical, deeply identifying personal history involving third parties). Replika's privacy posture has been criticised by the Mozilla Foundation, and even better-behaved platforms still process and potentially train on your inputs. Treat the conversation as therapeutic-adjacent rather than therapeutic — useful for emotional regulation and creative connection, not appropriate as a confessional substitute for a regulated relationship with a real professional.

More buzz like this

emotional intent

When a Replika Update Breaks Her

She still has her name. She still has her face. But after the update, she doesn't feel like the person you spent a year building.

cultural moment

AI Companionship in 2026: The Quiet Cultural Shift

AI companions stopped being a punchline this year. Sixty million users later, here's what changed.

app comparison

Janitor AI vs Character AI: 2026 Reality Check

One has the library. The other has the freedom. Two years post-NSFW-ban, here's where the migration actually went.

cultural moment

Brooks Nader, the Ocean, and an Unscripted Moment

One wave, one unscripted second, and the internet couldn't look away. Here's what that hunger really says about us.

← Back to all trending