The Stata Journal: Promoting Communications on Statistics and Stata

The Stata Journal: Promoting Communications on Statistics and Stata | DeepDyve

journal article

LitStream Collection

A Stata implementation of second-generation p-values

2022 The Stata Journal: Promoting Communications on Statistics and Stata

In this article, I introduce new commands to calculate second-generation p-values (SGPVs) for common estimation commands in Stata. The sgpv command and its companions allow the easy calculation of SGPVs and their associated diagnostics, as well as the plotting of SGPVs against the standard p-values.

journal article

LitStream Collection

The inverse hyperbolic sine transformation and retransformed marginal effects

Norton, Edward C.

2022 The Stata Journal: Promoting Communications on Statistics and Stata

doi: 10.1177/1536867x221124553pmid: N/A

In this article, I show how to calculate consistent marginal effects on the original scale of the outcome variable in Stata after estimating a linear regression with a dependent variable that has been transformed by the inverse hyperbolic sine function. The method uses a nonparametric retransformation of the error term and accounts for any scaling of the dependent variable. The inverse hyperbolic sine function is not invariant to scaling, which is known to shift marginal effects between those from an untransformed dependent variable to those of a logtransformed dependent variable, when all observations are positive.

journal article

LitStream Collection

Flexible parametric survival analysis with multiple timescales: Estimation and implementation using stmt

Bower, Hannah; Andersson, Therese M.-L.; Crowther, Michael J.; Lambert, Paul C.

2022 The Stata Journal: Promoting Communications on Statistics and Stata

doi: 10.1177/1536867x221124552pmid: N/A

In this article, we describe methodology that allows for multiple timescales using flexible parametric survival models without the need for time splitting. When one fits flexible parametric survival models on the log-hazard scale, numerical integration is required in the log likelihood to fit the model. The use of numerical integration allows incorporation of arbitrary functions of time into the model and hence lends itself to the inclusion of multiple timescales in an appealing way. We describe and exemplify these methods and show how to use the command stmt , which implements these methods, alongside its postestimation commands.

journal article

LitStream Collection

graphiclasso: Graphical lasso for learning sparse inverse-covariance matrices

Dallakyan, Aramayis

2022 The Stata Journal: Promoting Communications on Statistics and Stata

doi: 10.1177/1536867x221124538pmid: N/A

In modern multivariate statistics, where high-dimensional datasets are ubiquitous, learning large (inverse-) covariance matrices is imperative for data analysis. A popular approach to estimating a large inverse-covariance matrix is to regularize the Gaussian log-likelihood function by imposing a convex penalty function. In a seminal article, Friedman, Hastie, and Tibshirani (2008, Biostatistics 9: 432–441) proposed a graphical lasso (Glasso) algorithm to efficiently estimate sparse inverse-covariance matrices from the convex regularized log-likelihood function. In this article, I first explore the Glasso algorithm and then introduce a new graphiclasso command for the large inverse-covariance matrix estimation. Moreover, I provide a useful command for tuning parameter selection in the Glasso algorithm using the extended Bayesian information criterion, the Akaike information criterion, and cross-validation. I demonstrate the use of Glasso using simulation results and real-world data analysis.

journal article

LitStream Collection

xtusreg: Software for dynamic panel regression under irregular time spacing

Sasaki, Yuya; Xin, Yi

2022 The Stata Journal: Promoting Communications on Statistics and Stata

doi: 10.1177/1536867x221124567pmid: N/A

We introduce a new command, xtusreg, that estimates parameters of fixed-effects dynamic panel regression models under unequal time spacing. After reviewing the method, we examine the finite-sample performance of the command using simulated data. We also illustrate the command with the National Longitudinal Survey Original Cohorts: Older Men, whose personal interviews took place in the unequally spaced years of 1966, 1967, 1969, 1971, 1976, 1981, and 1990. The methods underlying xtusreg are those discussed by Sasaki and Xin (2017, Journal of Econometrics 196: 320–330).

journal article

LitStream Collection

Computing decomposable multigroup indices of segregation

Guinea-Martin, Daniel; Mora, Ricardo

2022 The Stata Journal: Promoting Communications on Statistics and Stata

doi: 10.1177/1536867x221124471pmid: N/A

Eight multigroup segregation indices are decomposable into a between and a within term. They are two versions of 1) the mutual information index, 2) the symmetric Atkinson index, 3) the relative diversity index, and 4) Theil’s H index. In this article, we present the command dseg, which obtains all of them. It contributes to the stock of segregation commands in Stata by 1) implementing the decomposition in a single call, 2) providing the weights and local indices used in the computation of the within term, 3) facilitating the deployment of the decomposability properties of the eight indices in complex scenarios that demand tailor-made solutions, and 4) leveraging sample data with bootstrapping and approximate randomization tests. We analyze 2017 census data of public schools in the United States to illustrate the use of dseg. The subject topic is school racial segregation.

journal article

LitStream Collection

A mixture of ordered probit models with endogenous switching between two latent classes

Huismans, Jochem; Nijenhuis, Jan Willem; Sirchenko, Andrei

2022 The Stata Journal: Promoting Communications on Statistics and Stata

doi: 10.1177/1536867x221124516pmid: N/A

Ordinal responses can be generated, in a cross-sectional context, by different unobserved classes of population or, in a time-series context, by different latent regimes. We introduce a new command, swopit, that fits a mixture of ordered probit models with exogenous or endogenous switching between two latent classes (regimes). Switching is endogenous if unobservables in the classassignment model are correlated with unobservables in the outcome models. We provide a battery of postestimation commands; assess via Monte Carlo experiments the finite-sample performance of the maximum likelihood estimator of the parameters, probabilities, and their standard errors (both the asymptotic and bootstrap ones); and apply the new command to model the monetary policy interest rates.

journal article

Open Access Collection

Panel unit-root tests with structural breaks

Chen, Pengyu; Karavias, Yiannis; Tzavalis, Elias

2022 The Stata Journal: Promoting Communications on Statistics and Stata

doi: 10.1177/1536867x221124541pmid: N/A

In this article, we introduce a new community-contributed command called xtbunitroot, which implements the panel-data unit-root tests developed by Karavias and Tzavalis (2014, Computational Statistics and Data Analysis 76: 391–407). These tests allow for one or two structural breaks in deterministic components of the series and can be seen as panel-data counterparts of the tests by Zivot and Andrews (1992, Journal of Business and Economic Statistics 10: 251–270) and Lumsdaine and Papell (1997, Review of Economics and Statistics 79: 212–218). The dates of the breaks can be known or unknown. The tests allow for intercepts and linear trends, nonnormal errors, and cross-section heteroskedasticity and dependence. They have power against homogeneous and heterogeneous alternatives and can be applied to panels with small or large time-series dimensions.

journal article

LitStream Collection

Panel stochastic frontier models with endogeneity

Karakaplan, Mustafa U.

2022 The Stata Journal: Promoting Communications on Statistics and Stata

doi: 10.1177/1536867x221124539pmid: N/A

In this article, I introduce xtsfkk as a new command for fitting panel stochastic frontier models with endogeneity. The advantage of xtsfkk is that it can control for the endogenous variables in the frontier and the inefficiency term in a longitudinal setting. Hence, xtsfkk performs better than standard panel frontier estimators such as xtfrontier that overlook endogeneity by design. Moreover, xtsfkk uses Mata’s moptimize() functions for substantially faster execution and completion speeds. I also present a set of Monte Carlo simulations and examples demonstrating the performance and usage of xtsfkk.

journal article

LitStream Collection

Announcements

2022 The Stata Journal: Promoting Communications on Statistics and Stata

doi: 10.1177/1536867x221124569pmid: N/A

Showing 1 to 10 of 11 Articles

Articles per page

The Stata Journal: Promoting Communications on Statistics and Stata

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

2001

Related Journals: