Not that I know anything about the GOP debates or candidates, but I casually saw in a CNN post this nice visualization of verbal attacks during the RL GOP Debate, and I thought that I would do a little SNA and try to draw conclusions on the debate WITHOUT actually having seen it…

let’s see how it goes and, please, if you’ve seen the debate and know better than me, let me know if I am very wrong 🙂

I find the use of statistics in the justice system a thrilling subject, specially so when you find out that some persons like Lucia de Berk have been handed life sentences based solely on flaw statistics coming from experts like Mr. Henk Elffers. So I’ll talk in this post about what he did wrong and how to avoid this kind of huge boo-boo in our statistical lives.

The use of statistics in the justice system has actually a long history, the amazing mathematician / engineer / physicist / philosopher of science Henri Poincaré already had to correct the misuse of statistics in the infamous Dreyfus trial.

But it was in the Lucia de Berk trial where combining p-values wrongly handed her a life sentence. I won’t go into the details of the trial, for that there are many other places like Mr. Richard D. Gillweb page account of the trial and a video worth to have a look to. Instead I will focus on how to appropriately deal with a bunch of p-values to make sense of our data. Continue reading →

One would think that humanity would not have a need for good random number generators until computers and simulations were invented since, for most practical purposes, tossing a coin or throwing a die should suffice us all. So you can imagine my surprise when I saw in this four to five thousand years old Chinese divination book called I Ching a RNG algorithm that reminds modern Linear Congruential Generators! But why the need for such a complex procedure to render random numbers?

The I Ching divination process requires to randomly select two trigrams via a rather convoluted process using either stems of Artemisia or Yarrow. And although I acquired this ancestral book a long, long, time ago, truth is that when reading it as an oracle I always used the simplified version for lazy busy people consisting in simply tossing three coins and checking the combination of heads and tails.

I always thought that the traditional form was just a magical way to do the same thing that we can do by tossing three coins, but today, for no particular reason that having too much free time in my hands, I gave a deeper mathematical look to this traditional form and it turns out that it renders a complete different random result that tossing three coins!

Well, a mathematical curiosity you might think, but does it matter? It might! Millions of people seek advice using the simplified coin version to render the I Ching Yin Yang oracles. In this post I will show how the three coins method yields an equal proportion on Old Yin and Old Yang oracles signs whereas the traditional method yields three times more Old Yang signs than Old Yin!

This means that The I Ching, in its traditional form to draw oracles, promotes Yang behaviour over Yin, that is, it promotes among its users action, imagination, creativity, strength whereas, nowadays, with the simplified three coin version, the active and passive answers are even out.

I am not a sinologist nor a psychologist so I cannot really tell what version would have a better influence among practitioners lives, but I know though that the traditional form promotes Yang among those seeking advice which, at first glance, seems like a positive thing to do and, since this book is used by millions of people, maybe experts in the field should advice to practitioners not to use three coins anymore when using the I Ching. For those interested in having a traditionally sound oracle in terms of probability, I will show a few simple ways to achieve just that at the end of this post.

This book has impressed mathematicians like Leibniz, psychologists like Jung, poets like Jorge Luis Borges and all kind of intellectuals all over the world for centuries. And regardless you believe or not whether it has magical properties, what is certain is that it has deep psychological sapiential ones. This is not only the oldest book in human history, but a beautiful one. So, before we plunge into the mathematical details of the traditional algorithm to draw oracles, let’s share this poem from Borges about the I Ching to break the ice.

For a Version of I Ching

Para una versión del I King

The future is as immutable As rigid yesterday. There is nothing That is no more than a single, silent letter In the eternal and inscrutable Writing whose book is time. He who walks away From home has already come back. Our life Is a future and well-traveled track. Nothing dismisses us. Nothing leaves us. Do not give up. The prison is dark, Its fabric is made of incessant iron, But in some corner of your cell You might discover a mistake, a cleft. The path is fatal as an arrow But God is in the rifts, waiting.

El porvenir es tan irrevocable Como el rígido ayer. No hay una cosa Que no sea una letra silenciosa De la eterna escritura indescrifrable Cuyo libro es el tiempo. Quien se aleja De su casa ya ha vuelto. Nuestra vida Es la senda futura y recorrida. Nada nos dice adiós. Nada nos deja. No te rindas. La ergástula es oscura, La firme trama es de incesante hierro, Pero en algún recodo de tu encierro Puede haber un descuido, una hendidura, El camino es fatal como la flecha Pero en las grietas está Dios, que acecha.

Up to this day I defined my theological position as Agnostic, which is not saying much given the different interpretations and philosophical flavors we have to position ourselves when it comes to God. This is why sometimes I instead simply reply to The Question with something like “Both alternatives are equally crazy, so I don’t know.“ But, can we use statistics to better describe our position in these kind of philosophical matters, or even dictate how should we live our lives? Yes, we can.

WARNING: Beware agnostics!!! I will show mathematical arguments that might turn you into a full blown Believer or a hardcore Atheist… So if you keep reading don’t say I did not warn you.

If we envision probability as a measure linked to a random process then questions like “What is the probability that God exists?” imply a sort of Supra-God that creates universes with Gods with a frequency p. But then some might argue that this Supra-God is actually God so, at the end, these kind of philosophical questions make no statistical sense for such frequentist interpretation of probability.

Then we have those that interpret probability as a degree of belief on matters subject to uncertainty, this interpretation is the one hold by Bayesian Statistics.

So if I wear a Bayesian hat and I am asked The Question then, instead replying “I don’t know” to describe my ignorance I should reply with “50%” or “p=1/2“. This is so because when Bayesians (The Objective Kind) have no information on a problem they use a plethora of principles in a Groucho style fashion to figure out a prior distribution to kick off Bayes’ Theorem machinery.

But there are an infinite number of prior distributions with an expected value of 1/2 so, which among this infinite number describe better my agnosticism? Is there such thing as a unique agnostic prior to rule them all? Well, it seems this Holy Grail does not exist since we can read in highly commendable Bayesian books like Bernardo & Smith thing like:

In general we feel that it is sensible to choose a non-informative prior which expresses ignorance relative to information which can be supplied by a particular experiment. If the experiment is changed, then the expression of relative ignorance can be expected to change correspondingly. (Box and Tiao, 1973 p.46).

Wait, what? We change the experiment and our prior ignorance changes too? In fact not all Bayesians agree with their existence; (Howson 2002; O’Hagan 2006; Press 2003) they regard any Bayesian Objective “non-informative” priors simply as well formed beliefs… So I’ll pick on the Subjective kind interpretation and in this post I am going to well form my belief in God.

Plus, in the process of cooking my Agnostic prior I’ll discuss why Bayesians should measure their beliefs from 0 to π instead from 0 to 1; This later measure is too frequentist for them and π makes more mathematical sense since trigonometrical functions are going to naturally pop up everywhere in our prior belief endeavor. Continue reading →

Human minds are the mother of all interesting things since anything that we might consider interesting is so because our minds make us believe so. Seems then reasonable that all kind of philosophical issues and scientific problems cannot be properly addressed unless we correctly understand how our minds work, but what we know about how they work?

Cognitive Science offers many theories on how any mind might work, but when it comes to our minds there seem to be evidences put forward by psychologists that, whatever the way they work, human minds do not abide to the laws of probabilities.

Several attempts have been made to explain these results, and one of the latest comes from the hand of Quantum Mechanics… No kidding.

So when I saw this valiant attempt from theoretical physicists to explain how the human mind works by using their all mighty and powerful Quantum Hammer, I thought it was a good moment to explain an alternative solution that I myself worked out long, long ago, after being exposed to this problem by philosopher Paul Thagard in his excellent book MIND.

Also, Sister Hot is my assistant and I need her to prove my point which is that our minds might abide to probability laws more than we think after all. If you want to know how she is going to assist me you need to keep reading; probability can be sexy 😉 Continue reading →

A while ago I found a very interesting paper from Leah R. Jager and Jeffrey T. Leek via a post in the Simply Statistics blog arguing that most published medical research is true with a rate of false positives among reported results of 14% ± 1%. Their paper came as a response to an essay from John P. A. Ioannidis and several others authors claiming that most published research findings are false.

After dealing with some criticisms Mr. Leek made a good point in his post:

“I also hope that by introducing a new estimator of the science-wise fdr we inspire more methodological development and that philosophical criticisms won’t prevent people from looking at the data in new ways.”

And thus, following this advice, I didn’t let criticisms prevent me from looking at the data in a new way. So for this problem I have devised a probability distribution for p-values to then fit the data via MLE and infer from there the rate of false positives.

So this is my take; 15.33% rate of false positive with a worse case scenario of 41.75% depending on how mischievous researchers are but, in any case, and contrary to what others authors claim, most medical research seems to be true.

At most 22 percent of catholic priests in the USA are homosexuals.

Homosexual men in the USA, as a group, molest children at a rate at least 15 times higher than heterosexual men.

One asteroid rubs Earth, a meteorite crashes on Russia and, a few days later, Pope Benedict XVI took the cosmic message and resigned. Nonetheless many people question the true reason for his resignation alleging that it has nothing to do with fatigue but rather with homosexuality networks within the Church (CNN guest claiming a 50% of homosexuals among priests) and unresolved pedophilia scandals. So I took a look at this percentage with our best friend when it comes to politically incorrect statistics; Bayes’ Theorem, and I got the results displayed above.

I know, I know, the numbers are pretty crazy, but they are based on data fetched from official sources and, before going into the details, let me play sociologist. Although homosexuals, as a group, molest children at higher rates than heterosexuals it is very important to realize that this does not necessarily mean homosexuals are more prone towards this behavior, assuming this might constitute an ecological fallacy, in this case it makes more sense that this outcome obeys to the fact that young boys are way less protected by parents than young girls and predators take advantage of this.

The Calculations

To estimate the rate of homosexuals among catholic priests we will first estimate how much more likely are male homosexuals to engage in pederasty compared to male heterosexuals, then we will use this result join with the by gender percentage of children abused by catholic priests (81 percent of the victims were males in the USA) to calculate the final figure.

Tests for linear trends were significant in the three ROIs:

dACC: F = 8.28, p < .014;

rACC: F = 17.97, p < .001;

amygdala: F = 30.02, p < .001

but not for higher order trends

The other study they mention does not involve any experiment and is merely a review of other studies.

Whether the significance in the study is significant for science is up to the researchers but, yeah, we can make assumptions with a sample size of just 13. Interestingly, others would regard “too many” people in a sample size as a manipulation to achieve significance. So I guess that when we don’t like something we can always find reasons to complain about it.

The last post in this Climategate series is dedicated to the climate of fear mongering we all see every now and then in the media claiming extreme weather patterns linked to global warming in an end-of-the-world tone. I will offer some insights and calculations to show that “extremist” might be wrong.

It seems that poor climatologists in Australia had no choice but to reuse the purple color already in use for the negative range (-25, -18) ºC for the positive range (50, 54) ºC. What could possibly have done these people but to mix cold and hot weather colors!? well, here’s an idea:

There it goes a present for climatologists in Australia; 121 not scary-oh-my-gaw-how-hot-it-is different colors for the range (-60,60) ºC, and just in case you need more I have a few spare millions. You’re welcome.