Election Poll for a single state
In this and follwoing posts, I'll present a way to compute Bayesian prediction for the result of USA 2016 election based on election poll data and near-ignorance prior models. This model is described in detail here:A. Benavoli and M. Zaffalon. "Prior near ignorance for inferences in the k-parameter exponential family". Statistics , 49:1104-1140, 2014. (http://www.idsia.ch/~alessio/benavoli2014b.pdf)
In USA 2016 election, we have two main candidates Trump and Clinton. We denote with θt Trump's winning probability; with θc Clinton's winning probability and θu the undecided case (undecided voters, other candidates etc.). The goal of the inference is to estimate these parameters and in particular the focus is to compare the proportions of voters for Trump and Clinton, i.e., θt−θc.
Likelihood model
In the election poll, a total of n adults are polled to indicate their preference for the candidates Trump and Clinton. Let ˆynt denote the proportion of the sample that supports Trump, ˆync denote the proportion that supports Clinton and ˆynu=1−ˆynt−ˆync denote the proportion that is either undecided or vote for someone else. The counts nˆynt (number of votes for Trump), nˆync (number of votes for Clinton) and nˆynu (undecided) are assumed to have a multinomial distribution with sample size n and respectively parameters θt (Trump), θc (Clinton) and θu (undecided). Thus, the likelihood model is:p(data|θ)=θnˆynttθnˆyntcθnˆynuu,
where θt+θc+θu=1 are the unknown non-negative chances to be estimated.
Noninformative Prior model
In the standard Bayesian apparoach, we need to choose a prior on the unknonw θ parameters. A Dirichlet conjugate prior is a natural prior for θt, θc and θu:p(θ)∝θαt−1tθαc−1cθαu−1u,
where in the case of lack of prior information the prior parameters are commonly selected as follows:
Haldane's prior αt=αc=αu=0 (that is an improper prior); Jeffreys' prior αt=αc=αu=12; uniform prior αt=αc=αu=1.
The expected value of E[θt−θu] is equal to 0 and the prior probability
P(θt>θu)=0.5 for both Jeffreys and uniform priors (for Haldane's prior they are not defined).
See the paper for details about how computoing this lower and upper bounds.
These are commonly called noninformative priors, but they are not noninformative. These priors express indifference between Trump and Clinton, but not prior ignorance.
To see that, consider P[θt+0.5θu>θc+0.4θu], this is the probability that the proportion of votes of Trump exceeds the votes for Clinton assuming a ``swing'' scenario in which 50% of the undecideds vote for Trump and 40% of the undecideds for Clinton. This probability is equal to 0.76 in the case of the uniform prior and 0.66 in the case of Jeffreys' prior. It depends on the choice of the prior and this shows that the uniform and Jeffrey's priors are not really uninformative for this kind of poll.
Combining likelihood and prior, the resulting posterior is
p(θ|n,ˆyn)∝θnˆynt+αt−1tθnˆync+αc−1cθnˆynu+αu−1u,
which is always proper in the case of the Jeffreys' and uniform prior and in the case of Haldane's prior provided that ˆynt,ˆync,ˆynu>0.
The posterior expected value of θt−θc (Trump-Clinton difference) is: E[θt−θc|n,ˆyn]=nˆynt+αtn+αt+αc+αu−nˆync+αcn+αt+αc+αu,
while the posterior probability of the event θt−θc>0 is
P[θt>θc|n,ˆyn]=∫{θt>θc}θnˆynt+αt−1tθnˆync+αc−1cθnˆynu+αu−1u dθ∫θnˆynt+αt−1tθnˆync+αc−1cθnˆynu+αu−1u dθ,
which can be computed numerically by sampling from the Dirichlet distribution.
Near-ignorance Prior model
We now present a weaker prior model (near-ignorance model) that automatically allows us to take into account of possible swing scenarios.A near-ignorance is a set of priors, in particular we consider this set:
M={θℓt−1tθℓc−1cθ−ℓt−ℓc−1u, |ℓi|≤v, −ℓt+ℓc∈[−v,v]},
here v is parameter that represents pseudo-votes and it determines the strength of the prior inferences on the posterior inferences
When we have set of probabilty distributions, inference is obtained by computing lower and upper bounds of the expectations and probabilties of interest. From this model, a priori we have E_[θt−θc]=−1 (lower expectation), ¯E[θt−θc]=1 (upper expectation). Moreover, we have that P_(θt>θc)=0, ¯P(θt>θc)=1 and that P_[θt+0.5θu>θc+0.4θu]=0, and ¯P[θt+0.5θu>θc+0.4θu]=1.This is really a model of prior ignorance. Before seein the data all the probabilties of interest can assume any value from 0 and 1 that means we are not biasing our inference neither towards Trump not towards Clinton. This is a more correct expression of the lack of prior information on the election result.
The resulting set of posteriors is
Mp={p(θ|data)∝θnˆynt+ℓt−1tθnˆync+ℓc−1cθnˆynu+ℓu−1u, |ℓi|≤v, −ℓt+ℓc∈[−v,v]},
From this set we can extract posteriors that are useful for analysing possible Swing Scenario:
- v votes move from Trump to Clinton, the resulting posterior is
p(θ|n,ˆyn)∝θnˆynt−v−1tθnˆync+v−1cθnˆynu−1u
- v votes move from Clinton to Trump, the resulting posterior is
p(θ|n,ˆyn)∝θnˆynt+v−1tθnˆync−v−1cθnˆynu−1u
~
No comments:
Post a Comment