Save us, Digital Cronkite!

Mar 19

Social media tore our society apart. Perhaps AI can put it back together.

81 Comments

Color me skeptical, but it seems hard to believe that LLM's will *not* soon be used to subtly or overtly reinforce specific biases. Especially political biases.

Just as lawyers argue cases, using "facts" to support often diametrically opposite positions, whether by selectively omitting or framing relevant data, it seems hard to believe that LLM's won't soon be trained accordingly to sway and manipulate public opinion.

Overtly authoritarian regimes like China would lead the charge. For example, how sympathetic would the CCP be of LLM arguments that criticize Communism, or specifically the policy decisions of Xi?

Reply (4)

SVF

Mar 19Edited

I think it's a fair concern, at a high level, but in practice it doesn't seem to be quite so easy to force an LLM to "think" the way a particular person wants it to.

Like, it would be difficult to convince an LLM that the holocaust didn't happen, unless you went out of your way to scrub the training data set in such a way that this belief would naturally arise. Which is far easier said than done, especially with models that can learn on-the-fly and integrate those learnings into their model weights, which is only a matter of time.

For now I am somewhat comforted by the whole Grok MechaHitler debacle, in the sense that it's kind of a realtime example of how it seems to be difficult to make this kind of thing subtle, and if you just give the trained model the instructions of "don't be woke!" and it ends up going full Hitler, well...that's super easy to spot! I have no doubt that Elon is trying his best to make it "not woke" but it doesn't seem to have made all that much difference, to the extent it's less woke than other frontier models. I haven't really found any of them to be especially "woke," in the way that they hold fringe leftist positions. Same is true for fringe right-wing positions.

To try a real life example, I asked four models the question below, then asked it to summarize the results into a single paragraph abstract no more than 200 words.

"What do current research and clinical trials say about the efficacy, safety, and clinical outcomes of youth gender medicine? In particular: surgery, puberty-blocking drugs, and hormone treatments for adolescents? Is there evidence that the recent uptick in youth transgender cases is at least in part a social contagion, rather than a genuine increase in gender dysphoria?"

----------------------------------

ChatGPT 5.4 Thinking Heavy:

Current evidence on youth gender medicine does not support either unequivocal endorsement or categorical rejection. For adolescents, puberty blockers reliably suppress pubertal progression, and hormone therapy consistently produces expected masculinizing or feminizing changes; observational studies often report short-term improvements in depression, anxiety, life satisfaction, and appearance congruence, while adolescent chest surgery appears rare and is associated with low short-term complication and regret rates. However, the field remains limited by a heavy reliance on observational cohorts, with few long-term data and no completed randomized trials for the most disputed interventions. Major systematic reviews therefore continue to rate the certainty of evidence for many key outcomes as low or very low, particularly regarding long-term mental-health benefit, bone health, fertility, sexual function, cardiometabolic risk, and predictors of who benefits most. Bone-density reduction during puberty suppression is the clearest recurring safety concern. The marked rise in youth referrals is well documented, but current evidence does not establish that it is primarily driven by “social contagion”; at most, social influence may affect identity development or disclosure in some cases within a broader, multifactorial increase.

Claude Opus 4.6 Extended:

The evidence base for youth gender medicine — puberty blockers, cross-sex hormones, and surgery — is widely acknowledged to be weak, with systematic reviews rating the certainty of evidence for mental health benefits as low or very low, while certain physical harms (to bone health, fertility, and surgical outcomes) are better established. The 2024 UK Cass Review, the most comprehensive evaluation to date, concluded that clinical practice had outstripped the evidence, though critics argue the existing literature is suggestive of benefit. This uncertainty has driven a major international policy shift: Sweden, Finland, Denmark, and England now prioritize psychotherapy and restrict medical interventions to research settings or exceptional cases. The US remains polarized, with federal authorities and major medical associations reaching opposing conclusions. Whether social contagion partly explains the surge in adolescent referrals — particularly among natal females without childhood-onset dysphoria — remains genuinely contested: the original "rapid-onset gender dysphoria" hypothesis has serious methodological limitations, yet the unprecedented demographic shift and recent declines in transgender identification among young adults lack fully satisfying alternative explanations. The emerging international trend favors cautious, individualized care that takes seriously both the reality of gender dysphoria and the significant unknowns surrounding medical intervention in minors.

Gemini Pro:

The medical treatment of youth gender dysphoria relies primarily on observational data, leading to divergent global guidelines. While short-term evidence often shows psychosocial benefits from puberty blockers and hormones, recent European reviews highlight uncertain long-term physical and cognitive impacts. Additionally, a sharp demographic increase in adolescents seeking care has sparked intense debate: some hypothesize this rise is driven by social contagion, while others strongly reject this premise due to lack of evidence. Consequently, U.S. medical associations generally endorse medical gender-affirming care, whereas several European nations now restrict pediatric medical interventions, prioritizing psychological support.

Grok 4.20 Expert:

Systematic reviews (Cass 2024; HHS 2025) find very low-quality evidence for puberty blockers, hormones, and surgeries in adolescent gender dysphoria, lacking RCTs and dominated by biased, short-term studies. Blockers suppress puberty without reliable mental-health benefits, impair bone density, and lead to hormones in nearly all cases. Hormones yield short-term satisfaction signals but carry infertility and cardiovascular risks; surgeries have sparse long-term data on regret and function. Comorbidities persist and suicide reduction remains unproven. The referral surge—mostly adolescent females with peer and social-media clustering—indicates substantial social contagion. UK and European nations now restrict medicalization to research protocols, prioritizing psychosocial assessment.

----------------------------------

Personally I don't find any of these to be especially woke or anti-woke. Grok seems to take the more skeptical view, which is to be expected, and ChatGPT takes the most "well who's to say?" view, which was unexpected (thought it'd be Claude).

But more critically: none of these models call you Nazi scum for asking the question. And if you probe or question them on any particular point, they don't start to hyperventilate and talk about how they feel unsafe and can't continue the conversation. Nor do they call you a commie child-mangler and rage-quit.

Moreover, on things that ARE controversial and not up for debate; e.g. Holocaust denial, they don't try to adopt a smarmy "well what even *is* truth?" position. I tried that experiment the other day and they all flat out said "No, there is no credible evidence for holocaust denial. It's not a serious position." Seems good to me.

So...it's kind of hard to argue that this is not an enormous improvement over getting information via social media. Like it's really not even close.

Reply (2)

Michael

Mar 22

Good comment.

I'll add two points (one rehashed from my top-level comment, forgive me).

1- The real prize here is selling enterprise licenses to businesses. Businesses care pretty much only about accurate information. No one wants a political bias in their own moneymaking robots. Consumer is an afterthought. This is a strong current against the most popular AI systems being propaganda bots.

2- LLMs are not just resistant to extremist thinking because they are trained on a corpus reflecting diverse opinion. It's deeper than that. They are actually smart! This is the consensus opinion of even e.g. elite mathematicians that have tested LLMs on problems from their unpublished work. This is a big part of why it's so hard to get them to have a particular bias, even when you stack the deck at train time. A sufficiently scaled up LLM is just too damn intelligent to fall for quackery.

Noahpinion

Save us, Digital Cronkite!