Title: | High-Dimensional Temporal Disaggregation |
---|---|
Description: | First - Generates (potentially high-dimensional) high-frequency and low-frequency series for simulation studies in temporal disaggregation; Second - a toolkit utilizing temporal disaggregation and benchmarking techniques with a low-dimensional matrix of indicator series previously proposed in Dagum and Cholette (2006, ISBN:978-0-387-35439-2) ; and Third - novel techniques proposed by Mosley, Gibberd and Eckley (2021) <arXiv:2108.05783> for disaggregating low-frequency series in the presence of high-dimensional indicator matrices. |
Authors: | Luke Mosley [aut, cre], Kaveh S. Nobari [aut]
|
Maintainer: | Luke Mosley <[email protected]> |
License: | GPL (>= 3) |
Version: | 2.0.0 |
Built: | 2025-01-31 05:24:15 UTC |
Source: | https://github.com/cran/TSdisaggregation |
Used in disaggregation.R to find estimates given the optimal rho parameter.
chowlin(Y, X, rho, aggMat, aggRatio, litterman = FALSE)
chowlin(Y, X, rho, aggMat, aggRatio, litterman = FALSE)
Y |
The low-frequency response series (n_l x 1 matrix). |
X |
The high-frequency indicator series (n x p matrix). |
rho |
The AR(1) residual parameter (strictly between -1 and 1). |
aggMat |
Aggregation matrix according to 'first', 'sum', 'average', 'last' (default is 'sum'). |
aggRatio |
Aggregation ratio e.g. 4 for annual-to-quarterly, 3 for quarterly-to-monthly (default is 4). |
litterman |
TRUE to use litterman vcov. FALSE for Chow-Lin vcov. Default is FALSE. |
y Estimated high-frequency response series (n x 1 matrix).
betaHat Estimated coefficient vector (p x 1 matrix).
u_l Estimated aggregate residual series (n_l x 1 matrix).
Chow GC, Lin A (1971). “Best linear unbiased interpolation, distribution, and extrapolation of time series by related series.” The review of Economics and Statistics, 372–375.
Used in disaggregation.R to find estimates of the optimal rho parameter.
chowlin_likelihood(Y, X, vcov)
chowlin_likelihood(Y, X, vcov)
Y |
The low-frequency response series (n_l x 1 matrix). |
X |
The aggregated high-frequency indicator series (n_l x p matrix). |
vcov |
Aggregated variance-covariance matrix of Chow-Lin or Litterman residuals. |
There are no references for Rd macro \insertAllCites
on this help page.
This function contains the traditional standard-dimensional temporal disaggregation methods proposed by Denton (1971), Dagum and Cholette (2006), Chow and Lin (1971), Fernandez (1981) and Litterman (1983), and the high-dimensional methods of Mosley et al. (2021).
disaggregate( Y, X = matrix(data = rep(1, times = nrow(Y)), nrow = nrow(Y)), aggMat = "sum", aggRatio = 4, method = "Chow-Lin", Denton = "first" )
disaggregate( Y, X = matrix(data = rep(1, times = nrow(Y)), nrow = nrow(Y)), aggMat = "sum", aggRatio = 4, method = "Chow-Lin", Denton = "first" )
Y |
The low-frequency response series (n_l x 1 matrix). |
X |
The high-frequency indicator series (n x p matrix). |
aggMat |
Aggregation matrix according to 'first', 'sum', 'average', 'last' (default is 'sum'). |
aggRatio |
Aggregation ratio e.g. 4 for annual-to-quarterly, 3 for quarterly-to-monthly (default is 4). |
method |
Disaggregation method using 'Denton', 'Denton-Cholette', 'Chow-Lin', 'Fernandez', 'Litterman', 'spTD' or 'adaptive-spTD' (default is 'Chow-Lin'). |
Denton |
Type of differencing for Denton method: 'absolute', 'first', 'second' and 'proportional' (default is 'first'). |
Takes in a n_l x 1 low-frequency series to be disaggregated Y and a n x p high-frequency matrix of p indicator series X. If n > n_l x aggRatio where aggRatio is the aggregation ration (e.g. aggRatio = 4 if annual-to-quarterly disagg or aggRatio = 3 if quarterly-to-monthly disagg) then extrapolation is done to extrapolate up to n.
y_Est Estimated high-frequency response series (n x 1 matrix).
beta_Est Estimated coefficient vector (p x 1 matrix).
rho_Est Estimated residual AR(1) autocorrelation parameter.
ul_Est Estimated aggregate residual series (n_l x 1 matrix).
Chow GC, Lin A (1971).
“Best linear unbiased interpolation, distribution, and extrapolation of time series by related series.”
The review of Economics and Statistics, 372–375.
Dagum EB, Cholette PA (2006).
Benchmarking, temporal distribution, and reconciliation methods for time series, volume 186.
Springer Science \& Business Media.
Denton FT (1971).
“Adjustment of monthly or quarterly series to annual totals: an approach based on quadratic minimization.”
Journal of the american statistical association, 66(333), 99–102.
Fernandez RB (1981).
“A methodological note on the estimation of time series.”
The Review of Economics and Statistics, 63(3), 471–476.
Litterman RB (1983).
“A random walk, Markov model for the distribution of time series.”
Journal of Business & Economic Statistics, 1(2), 169–173.
Mosley L, Eckley I, Gibberd A (2021).
“Sparse Temporal Disaggregation.”
arXiv preprint arXiv:2108.05783.
data = TempDisaggDGP(n_l=25,n=100,p=10,rho=0.5) X = data$X_Gen Y = data$Y_Gen fit_chowlin = disaggregate(Y=Y,X=X,method='Chow-Lin') y_hat = fit_chowlin$y_Est
data = TempDisaggDGP(n_l=25,n=100,p=10,rho=0.5) X = data$X_Gen Y = data$Y_Gen fit_chowlin = disaggregate(Y=Y,X=X,method='Chow-Lin') y_hat = fit_chowlin$y_Est
Used in disaggregation.R to find estimates given the optimal rho parameter.
sptd(Y, X, rho, aggMat, aggRatio, adaptive = FALSE)
sptd(Y, X, rho, aggMat, aggRatio, adaptive = FALSE)
Y |
The low-frequency response series (n_l x 1 matrix). |
X |
The high-frequency indicator series (n x p matrix). |
rho |
The AR(1) residual parameter (strictly between -1 and 1). |
aggMat |
Aggregation matrix according to 'first', 'sum', 'average', 'last' (default is 'sum'). |
aggRatio |
Aggregation ratio e.g. 4 for annual-to-quarterly, 3 for quarterly-to-monthly (default is 4). |
adaptive |
TRUE to use adaptive lasso penalty. FALSE for lasso penalty. Default is FALSE. |
y Estimated high-frequency response series (n x 1 matrix).
betaHat Estimated coefficient vector (p x 1 matrix).
u_l Estimated aggregate residual series (n_l x 1 matrix).
Mosley L, Eckley I, Gibberd A (2021). “Sparse Temporal Disaggregation.” arXiv preprint arXiv:2108.05783.
Used in disaggregation.R to find estimates of the optimal rho parameter.
sptd_BIC(Y, X, vcov)
sptd_BIC(Y, X, vcov)
Y |
The low-frequency response series (n_l x 1 matrix). |
X |
The aggregated high-frequency indicator series (n_l x p matrix). |
vcov |
Aggregated variance-covariance matrix of AR(1) residuals. |
There are no references for Rd macro \insertAllCites
on this help page.
This function generates the high-frequency response vector
, according to
, where
is an
matrix of indicator
series, and the
coefficient vector may be sparse. The low-frequency
vector
can be generated by pre-multiplying an aggregation matrix
matrix, such that the sum, the average, the last or the first value of
equates the
corresponding
observation. The parameter aggRatio is the specified aggregation ratio between the low and high frequency series, e.g. aggRatio = 4 for annual-to-quarterly
and aggRatio = 3 for quarterly-to-monthly. If
, then the last
columns of the aggregation matrix are 0 such that
is only observed up to
.
For a comprehensive review, see Dagum and Cholette (2006).
TempDisaggDGP( n_l, n, aggRatio = 4, p = 1, beta = 1, sparsity = 1, method = "Chow-Lin", aggMat = "sum", rho = 0, mean_X = 0, sd_X = 1, sd_e = 1, simul = FALSE, setSeed = 42 )
TempDisaggDGP( n_l, n, aggRatio = 4, p = 1, beta = 1, sparsity = 1, method = "Chow-Lin", aggMat = "sum", rho = 0, mean_X = 0, sd_X = 1, sd_e = 1, simul = FALSE, setSeed = 42 )
n_l |
Size of the low frequency series. |
n |
Size of the high frequency series. |
aggRatio |
aggregation ratio (default is 4) |
p |
The number of high-frequency indicator series to include. |
beta |
The positive and negative beta elements for the coefficient vector. |
sparsity |
Sparsity percentage of the coefficient vector. |
method |
DGP of residuals, either 'Denton', 'Denton-Cholette', 'Chow-Lin', 'Fernandez', 'Litterman'. |
aggMat |
Aggregation matrix according to 'first', 'sum', 'average', 'last'. |
rho |
The residual autocorrelation coefficient. Default is 0. |
mean_X |
Mean of the design matrix. Default is 0. |
sd_X |
Standard deviation of the design matrix. Default is 1. |
sd_e |
Standard deviation of the errors. Default is 1. |
simul |
When 'TRUE' the design matrix and the coefficient vector are fixed. |
setSeed |
The seed used when 'simul' is set to 'TRUE'. |
y_Gen Generated high-frequency response series.
Y_Gen Generated low-frequency response series.
X_Gen Generated high-frequency indicator series.
Beta_Gen Generated coefficient vector.
e_Gen Generated high-frequency residual series.
Dagum EB, Cholette PA (2006). Benchmarking, temporal distribution, and reconciliation methods for time series, volume 186. Springer Science \& Business Media.
data = TempDisaggDGP(n_l=25, n=100, aggRatio=4,p=10, rho=0.5) X = data$X_Gen Y = data$Y_Gen
data = TempDisaggDGP(n_l=25, n=100, aggRatio=4,p=10, rho=0.5) X = data$X_Gen Y = data$Y_Gen