Bayesian Inference under NR | RDP 2023-07: Identification and Inference under Narrative Restrictions

RDP 2023-07: Identification and Inference under Narrative Restrictions 5. Bayesian Inference under NR

Raffaella Giacomini, Toru Kitagawa and Matthew Read

October 2023

Download the Paper 1.10MB

This section presents approaches to conducting Bayesian inference in SVARs under NR. Section 5.1 discusses how to modify the standard Bayesian approach in AR18 to use the unconditional likelihood rather than the conditional likelihood. Section 5.2 explains how to conduct robust Bayesian inference under NR, which further addresses the issue of posterior sensitivity due to a flat likelihood.

5.1 Standard Bayesian inference

AR18 propose an algorithm for drawing from the uniform normal-inverse-Wishart posterior of $(ϕ, Q)$ given traditional sign restrictions and NR. This is the posterior induced by a normal-inverse-Wishart prior for $ϕ$ and a uniform prior for Q. The algorithm draws $ϕ$ from a normal-inverse-Wishart distribution and Q from a uniform distribution over $𝒪 (n)$ , and checks whether the restrictions are satisfied. If not, the joint draw is discarded and another draw is made. If the restrictions are satisfied, the ex ante probability that the NR are satisfied at the drawn parameter values is approximated via Monte Carlo simulation. Once sufficient draws are obtained satisfying the restrictions, the draws are resampled with replacement using as importance weights the inverse of the probability that the NR are satisfied.^[15]

This algorithm can be interpreted as drawing from the posterior based on the unconditional likelihood and then using importance sampling to transform into draws from the posterior based on the conditional likelihood. Drawing from the posterior based on the unconditional likelihood therefore simply requires omitting the importance-sampling step. Constructing the importance weights requires Monte Carlo integration, which can be computationally expensive, particularly when the NR constrain the structural shocks in multiple periods. Omitting the importance-sampling step can therefore ease computational burden.

The algorithm described above places more weight on values of $ϕ$ (relative to the notional normal-inverse-Wishart prior) that are more likely to satisfy the restrictions under the uniform distribution over $𝒪 (n)$ (i.e. values with ‘larger’ conditional identified sets). As discussed in Uhlig (2017), it may instead be preferable to use a prior that is conditionally uniform over the identified set for Q. To draw from the posterior of $(ϕ, Q)$ under the unconditional likelihood given a conditionally uniform prior for Q simply requires obtaining a fixed number of draws of Q at each draw of $ϕ$ .

5.2 Robust Bayesian inference

Standard Bayesian inference based on the unconditional likelihood (or based on the conditional likelihood under shock-sign restrictions) is potentially sensitive to the choice of conditional prior for Q given $ϕ$ , because the likelihood possesses flat regions. This section explains how to conduct robust Bayesian inference about a scalar-valued function of the structural parameters under NR and traditional sign restrictions. The approach can be viewed as performing global sensitivity analysis to assess whether posterior conclusions are robust to the choice of prior on the flat regions of the likelihood. We assume that the object of interest is an impulse response $η$ , but the discussion applies to any other scalar-valued function of the structural parameters.

Let $π_{ϕ}$ be a prior over the reduced-form parameters $ϕ \in Φ$ , where $Φ$ is the space of reduced-form parameters such that $𝒬 (ϕ | S)$ is non-empty. A joint prior for $(ϕ, Q) \in Φ \times 𝒪 (n)$ can be written as $π_{ϕ, Q} = π_{Q | ϕ} π_{ϕ}$ , where $π_{Q | ϕ}$ is supported only on $𝒬 (ϕ | S)$ . When there are only traditional identifying restrictions, $π_{Q | ϕ}$ is not updated by the data, because the likelihood is not a function of Q. Posterior inference may therefore be sensitive to the choice of conditional prior, even asymptotically. As discussed above, a similar issue arises under NR. The difference under NR is that $π_{Q | ϕ}$ is updated by the data through the truncation points of the unconditional likelihood. However, at each value of $ϕ$ , the unconditional likelihood is flat over the set of values of Q satisfying the NR. Consequently, the conditional posterior for $Q | ϕ, Y^{T}$ is proportional to the conditional prior for $Q | ϕ$ at each $ϕ$ whenever the conditional identified set for Q given $(ϕ, Y^{T})$ is non-empty.

Rather than specifying a single conditional prior for Q, the robust Bayesian approach of GK21 considers the set of all conditional priors for Q that are consistent with the identifying restrictions:

(35)

Π_{Q | ϕ} = {π_{Q | ϕ} : π_{Q | ϕ} (𝒬 (ϕ | S)) = 1}

Notice that we cannot impose the NR using a particular conditional prior due to the data-dependent mapping from $ϕ$ to Q induced by the NR. However, by considering all possible conditional priors that are consistent with the traditional identifying restrictions, we trace out all possible conditional posteriors for $Q | ϕ, Y^{T}$ that are consistent with the traditional identifying restrictions and the NR. This is because the NR truncate the unconditional likelihood and the traditional identifying restrictions truncate the prior for $Q | ϕ$ , so the posterior for $Q | ϕ, Y^{T}$ is supported only on values of Q that satisfy both sets of restrictions.

Given a particular prior for $(ϕ, Q)$ and using the unconditional likelihood, the posterior is

(36)

\begin{array}{l} π_{ϕ, Q | Y^{T}, D_{N} = 1} \propto p (Y^{T}, D_{N} = 1 | ϕ, Q) π_{Q | ϕ} π_{ϕ} \\ \propto f (Y^{T} | ϕ) D_{N} (ϕ, Q, Y^{T}) π_{ϕ} π_{Q | ϕ} \\ \propto π_{ϕ | Y^{T}} π_{Q | ϕ} D_{N} (ϕ, Q, Y^{T}) \end{array}

The final expression for the posterior makes it clear that any prior for $Q | ϕ$ that is consistent with the traditional identifying restrictions is in effect further truncated by the NR (through the likelihood) once the data are realised. Generating this posterior using every prior in the set of conditional priors yields a set of posteriors for $(ϕ, Q)$ :

(37)

Π_{ϕ, Q | Y^{T}, D_{N} = 1} = {π_{ϕ, Q | Y^{T}, D_{N} = 1} = π_{ϕ | Y^{T}} π_{Q | Y} D_{N} (ϕ, Q, Y^{T}) : π_{Q | ϕ} \in Π_{Q | ϕ}}

Marginalising each posterior in this set induces a set of posteriors for $η, Π_{η | Y^{T}, D_{N = 1}}$ . Associated with each of these posteriors are quantities such as the posterior mean, median and other quantiles. For example, as we consider each possible prior within $Π_{Q | ϕ}$ , we can trace out the set of all possible posterior means for $η$ . This will always be an interval, so we can summarise this ‘set of posterior means’ by its end points:

(38)

[\int_{Φ} ℓ (ϕ, Y^{T}) d π_{ϕ | Y^{T}}, \int_{Φ} u (ϕ, Y^{T}) d π_{ϕ | Y^{T}}]

where $ℓ (ϕ, Y^{T}) = \inf {η (ϕ, Q) : Q \in 𝒬 (ϕ | Y^{T}, N, S)}, u (ϕ, Y^{T})$ $= \sup {η (ϕ, Q) : Q \in 𝒬 (ϕ | Y^{T}, N, S)}$ and $𝒬 (ϕ | Y^{T}, N, S) = {θ (ϕ | S) \cap θ (ϕ | Y^{T}, N)}$ is the set of values of Q that are consistent with the traditional identifying restrictions and the NR (i.e. the conditional identified set). In contrast, in GK21 the set of posterior means is obtained by finding the infimum and supremum of $η (ϕ, Q)$ over $𝒬 (ϕ | S)$ and averaging these over $π_{ϕ | Y^{T}}$ . The important difference from GK21 is that the current set of posterior means depends on the data not only through the posterior for $ϕ$ but also through the conditional identified set generated by the NR. As a result, in contrast with GK21, we cannot interpret the set of posterior means (Equation (38)) as a consistent estimator for the identified set for $η$ (which is not well-defined, as we discussed above). Nevertheless, the set of posterior means still carries a robust Bayesian interpretation similar to GK21 in that it clarifies posterior results that are robust to the choice of prior on the non-updated part of the parameter space (i.e. on the flat regions of the likelihood).

As in GK21, we can also report a robust credible region with credibility level $α$ . This is the shortest interval estimate for $η$ such that the posterior probability put on the interval is greater than or equal to $α$ uniformly over the posteriors in $Π_{η | Y^{T}, D_{N = 1}}$ (see Proposition 1 of GK21). We can also report posterior lower and upper probabilities. These are the infimum and supremum, respectively, of the probability for a hypothesis over all posteriors in the set.

To numerically implement this robust Bayesian procedure, we extend the numerical algorithms in GK21 to handle NR. We approximate the bounds of the conditional identified set at each value of $ϕ$ using a simulation-based approach based on Algorithm 2 of GK21. See Appendix A for details.

Footnote

Based on the results in Arias et al (2018), AR18 argue that their algorithm draws from a normal-generalised-normal posterior for the SVAR's structural parameters (A₀,A₊) induced by a conjugate normal-generalised-normal prior, conditional on the restrictions. [15]