leave a comment

Science Takes a Double Hit in the Press, Maybe

In his latest New Yorker piece The Truth Wears Off, Jonah Lehrer directs our attention to the lack of repro­ducibility of results in sci­en­tific research. The problem is per­vasive, he says:

…now all sorts of well-​​established, mul­tiply con­firmed finding have started to look increas­ingly uncertain. It’s as if our facts were losing their truth: claims that have been enshrined in text­books are sud­denly unprovable. This phe­nomenon  doesn’t yet have an official name, but it’s occurring across a wide range of fields, from psy­chology to ecology. In the field of med­icine, the phe­nomenon seems extremely widespread…

The Decline Effect, as Lehrer calls it, refers to sci­en­tists’ inability to reproduce reported results. The problem isn’t simple: it’s not just that dif­ferent inves­ti­gators or teams come up with con­flicting infor­mation, or interpret the same raw data in dis­parate ways; over time, a single sci­entist may not be able to reproduce his or her own observations.

Lehrer begin his story with a target loaded with potential bias and con­flicts of interest – a 2007 meeting in Brussels of sci­en­tists, shrinks and pharma exec­u­tives con­tem­plating the dis­ap­pointing results in recent large clinical trials of block­buster antipsy­chotic drugs like Abilify, Seroquel and Zyprexa. Initial reports, mainly from the early 1990s, which sup­ported these drugs’ FDA approval and wide­spread use, turned out to present a too-​​positive story. Later studies indicate these agents are not as ter­rific as was adver­tised; new data call into question the drugs’ effec­tiveness and safety.

This is probably true, but it’s hardly sur­prising. It happens in oncology all the time — when drug com­panies support initial studies of new drugs with an intention to sell those, it’s some­times the case (and unfor­tu­nately fre­quent) that initial reports are more promising than what really happens after a decades’ worth of less careful (i.e. more open) selection of patients who take an FDA-​​approved med­ication. Once you include a broader group of patients in the analysis, whose doctors aren’t researchers whose salaries are sup­ported by the drug makers, the like­lihood of getting truthful reports of side effects and effec­tiveness shoots up.

So I don’t think Lehrer’s big-​​pharma example is a rea­sonable shot at the sci­en­tific method, per se. Rather, it’s a valid per­spective on problems that arise when drug com­panies sponsor what’s sup­posed to be objective, sci­en­tific research.

Lehrer moves on to what might be purer example of the decline effect. He tells the story of Pro­fessor Jonathan Schooler, a now-​​tenured pro­fessor who dis­covered in the 1980s that humans’ mem­ories are strengthened by the act of describing them. The work is cited often, Lehrer says.

…But while Schooler was pub­lishing these results in highly rep­utable journals, a secret worry gnawed at him: it was proving dif­ficult to replicate his earlier findings. ‘I’d often still see an effect, but the effect just wouldn’t be as strong.’

Next, Lehrer steps back in history. He relates the story of Joseph Banks Rhine, a psy­chol­ogist at Duke who in the early 1930s developed an interest in the pos­si­bility of extrasensory per­ception. (Yes, that would be ESP.) Rhine devised exper­i­ments to evaluate indi­viduals’ capacity to guess which symbol-​​bearing cards might be drawn from a deck, before they’re drawn. The initial findings were uncanny: “Rhine doc­u­mented these stunning results in his notebook and pre­pared several papers for pub­li­cation. But then, just as he began to believe in the pos­si­bility of extrasensory per­ception, the student lost his spooky talent…”

Schooler, plagued with self-​​doubt about his pub­lished findings on human memory, as Lehrer tells it, embarked on an “ironic” attempt to replicate Rhine’s work on ESP. In 2004, he set up exper­i­ments in which he flashed images and asked a subject to identify those; next he ran­domly selected some of those images for a second showing, to see if those were more likely to have been iden­tified in the first round.

“The craziness of the hypothesis was the point,” Lehrer says. “But he wasn’t testing extrasensory powers; he was testing the decline effect.” He continues:

‘At first, the data looked amazing, just as we’d expected,’ Schooler says. ‘I couldn’t believe the amount of pre­cog­nition we were finding. But then, as we kept on running sub­jects, the effect size’ – a standard sta­tis­tical measure – ‘kept on getting smaller and smaller.’ The sci­en­tists even­tually tested more than two thousand under­grad­uates …‘We found this strong para­normal effect, but it dis­ap­peared on us.’

OK, are we talking science, or X-​​Files? I find this par­ticular episode – both in its original, depression-​​era version and in Schooler’s 1990s remake – fas­ci­nating, even thought-​​provoking. But these don’t change my con­fi­dence in the sci­en­tific method one iota.

He moves on to con­sider a zool­ogist in Uppsala, Sweden, who pub­lished on sym­metry and barn swallows’ mating pref­er­ences, aes­thetics and genetics whose Nature–pub­lished the­ories on “fluc­tu­ating asym­metry” haven’t stood the test of time. After an initial blitz of con­fir­matory reports and curious, related findings, the observed results dimin­ished. Another sci­entist, said to have been very enthu­si­astic about the subject and who tried to reproduce them with studies of sym­metry in male horned beetles, couldn’t find an effect. The researcher laments:

‘But the worst part was that when I sub­mitted these null results I had dif­fi­culty getting them pub­lished. The journals only wanted con­firming data. It was too exciting an idea to disprove…’

Next, Lehrer advances toward a more general dis­cussion on bias in sci­en­tific pub­lishing. This can only partly explain the decline effect, he says. Intel­lectual fads and journal editors’ pref­er­ences for new and pos­itive results lead to imbalance in reporting. Pub­li­cation bias dis­torts the reporting of pos­itive clinical trials over neg­ative or incon­clusive results. No argument here –

Still, the problem goes deeper. Lehrer inter­views Richard Palmer, a biol­ogist in Alberta who’s used a sta­tis­tical method called a funnel plot to evaluate trends in pub­lished research findings. What happens, Palmer says, is that researchers are dis­posed (or vul­nerable?, ES) to selective reporting based on their uncon­scious per­cep­tions of truth and uneven enthu­siasm for par­ticular con­cepts. He gives an example:

…While acupuncture is widely accepted as a medical treatment in various Asian coun­tries, its use is much more con­tested in the west. These cul­tural dif­fer­ences have pro­foundly influ­enced the results of clinical trials. Between 1966 and 1995, there were forty-​​seven studies of acupuncture in China, Taiwan, and Japan, and ever single trial con­cluded that acupuncture was an effective treatment. During the same period, there were ninety-​​four clinical trials of acupuncture in the United States, Sweden, and the U.K., and only fifty-​​six percent of these studies found any ther­a­peutic benefits.

These dis­crepant reports support that sci­en­tists see data in ways that confirm their pre­con­ceived ideas. “Our beliefs are a form of blindness,” Lehrer writes. In Wired he quotes Paul Simon: “A man sees what he wants to see and dis­re­gards the rest.” The point is clear.

Nearing the end, Lehrer draws on and extends upon David Freedman’s November Atlantic feature, Lies, Damned Lies, and Medical Science, on the critical, out­standing oeuvre of John Ioan­nidis, a Stanford epi­demi­ol­ogist who elu­ci­dates false­hoods in pub­lished research.

Re-​​reading these two articles together, as I did this morning, can be dis­heart­ening. “Trust no one,” I recalled. Seems like many — and pos­sibly most — pub­lished research papers are untrue or at least exag­gerated and/​or mis­leading. But on further and closer review, maybe the evi­dence for per­vasive untruths is not so solid.

In sum, the Truth Wears Off, in last week’s Annals of Science, offers valuable ideas — the decline effect (new), the statistician’s funnel plot (not new, but needing attention) and pub­li­cation bias (tiresome, but def­i­nitely rel­evant). The ESP story is an obvious weak link in the author’s argument, as is the article’s emphasis and reliance, to some degree, on psy­cho­logical models and findings in rel­a­tively soft fields of research. Physics, genetics, mol­e­cular biology and ulti­mately most aspects of cancer med­icine, I know and hope — can be mea­sured, tested and reported objectively.

My approach to new infor­mation is always to keep in mind who are my sources, whether those are authors of an article I’m reading or a doctor who’s making a rec­om­men­dation about a pro­cedure for someone in my family, and the lim­i­ta­tions of my own expe­ri­ences. I’m skep­tical about new drugs and medical tools, but deter­mi­nately open-​​minded.

The problem is this: if we close our minds to all new findings, we’ll never learn any­thing. Nor will we ever get better. Some­times sci­en­tific reports are accurate, life-​​saving or even paradigm-​​shifting; if only we could know which those are -

“When the exper­i­ments are done, we still have to choose what to believe,” Lehrer concludes.

He’s right; I agree. Our choices, though, should be informed — through lit­eracy, mul­tiple sources of infor­mation, and common sense.

—–

Related Posts:

Leave a Reply

  

  

  


*