经典的面板数据集(R语言包plm)
R语言包plm内置的面板数据集
- 1 Cigarette Consumption
- 2 Crime in North Carolina
- 3 Gasoline Consumption
- 4 Grunfeld's Investment Data
- 5 Wages and Hours Worked
- 6 Wages and Education of Young Males
- 7 Purchasing Power Parity and other parity relationships
- 8 US States Production
- 9 Production of Rice in Indonesia
- 10 Employment and Wages in Spain
- 11 The Penn World Table, v. 5
- 12 Panel Data of Individual Wages
(注意:本博客关于数据的描述全部来自于R语言内置帮助文件,仅供自己学习使用)
1 Cigarette Consumption
data(Cigar)
Description
a panel of 46 observations from 1963 to 1992
Format
A data frame containing :
state
state abbreviation
year
the year
price
price per pack of cigarettes
pop
population
pop16
population above the age of 16岁以上人口总数
cpi
consumer price index (1983=100) 消费者价格指数
ndi
per capita disposable income 人均可支配收入
sales
cigarette sales in packs per capita 人均卷烟销售量(包)
pimin
minimum price in adjoining states per pack of cigarettes 相邻州每包香烟的最低价格
Details
total number of observations : 1380
observation : regional
country : United States
Source
Online complements to Baltagi (2001):
https://www.wiley.com/legacy/wileychi/baltagi/
Online complements to Baltagi (2013):
https://bcs.wiley.com/he-bcs/Books?action=resource&bcsId=4338&itemId=1118672321&resourceId=13452
References
Baltagi BH (2001). Econometric Analysis of Panel Data, 3rd edition. John Wiley and Sons ltd.
Baltagi BH (2013). Econometric Analysis of Panel Data, 5th edition. John Wiley and Sons ltd.
Baltagi B, Levin D (1992). “Cigarette taxation: Raising revenues and reducing consumption.” Structural Change and Economic Dynamics, 3(2), 321-335. https://EconPapers.repec.org/RePEc:eee:streco✌️3:y:1992:i:2:p:321-335.
Baltagi BH, Griffin JM, Xiong W (2000). “To Pool or Not to Pool: Homogeneous Versus Heterogeneous Estimators Applied to Cigarette Demand.” The Review of Economics and Statistics, 82(1), 117-126. doi: 10.1162/003465300558551, https://doi.org/10.1162/003465300558551.
2 Crime in North Carolina
data(Crime)
Description
a panel of 90 observational units (counties) from 1981 to 1987
Format
A data frame containing :
county
county identifier
year
year from 1981 to 1987
crmrte
crimes committed per person
prbarr
‘probability’ of arrest
prbconv
‘probability’ of conviction
prbpris
‘probability’ of prison sentence
avgsen
average sentence, days
polpc
police per capita
density
people per square mile
taxpc
tax revenue per capita
region
factor. One of ‘other’, ‘west’ or ‘central’.
smsa
factor. (Also called “urban”.) Does the individual reside in a SMSA (standard metropolitan statistical area)?
pctmin
percentage minority in 1980
wcon
weekly wage in construction
wtuc
weekly wage in transportation, utilities, communications
wtrd
weekly wage in wholesale and retail trade
wfir
weekly wage in finance, insurance and real estate
wser
weekly wage in service industry
wmfg
weekly wage in manufacturing
wfed
weekly wage in federal government
wsta
weekly wage in state government
wloc
weekly wage in local government
mix
offence mix: face-to-face/other
pctymle
percentage of young males (between ages 15 to 24)
lcrmrte
log of crimes committed per person
lprbarr
log of ‘probability’ of arrest
lprbconv
log of ‘probability’ of conviction
lprbpris
log of ‘probability’ of prison sentence
lavgsen
log of average sentence, days
lpolpc
log of police per capita
ldensity
log of people per square mile
ltaxpc
log of tax revenue per capita
lpctmin
log of percentage minority in 1980
lwcon
log of weekly wage in construction
lwtuc
log of weekly wage in transportation, utilities, communications
lwtrd
log of weekly wage in wholesale and retail trade
lwfir
log of weekly wage in finance, insurance and real estate
lwser
log of weekly wage in service industry
lwmfg
log of weekly wage in manufacturing
lwfed
log of weekly wage in federal government
lwsta
log of weekly wage in state government
lwloc
log of weekly wage in local government
lmix
log of offence mix: face-to-face/other
lpctymle
log of percentage of young males (between ages 15 to 24)
Details
total number of observations : 630
observation : regional
country : United States
The variables l* (lcrmrte, lprbarr, …) contain the pre-computed logarithms of the base variables as found in the original data set. Note that these values slightly differ from what R’s log() function yields for the base variables. In order to reproduce examples from the literature, the pre-computed logs need to be used, otherwise the results differ slightly.
Source
Journal of Applied Econometrics Data Archive (complements Baltagi (2006)):
http://qed.econ.queensu.ca/jae/2006-v21.4/baltagi/
Online complements to Baltagi (2001):
https://www.wiley.com/legacy/wileychi/baltagi/
Online complements to Baltagi (2013):
https://bcs.wiley.com/he-bcs/Books?action=resource&bcsId=4338&itemId=1118672321&resourceId=13452
See also Journal of Applied Econometrics data archive entry for Baltagi (2006) at http://qed.econ.queensu.ca/jae/2006-v21.4/baltagi/.
References
Cornwell C, Trumbull WN (1994). “Estimating the economic model of crime with panel data.” Review of Economics and Statistics, 76, 360–366.
Baltagi BH (2006). “Estmating an economic model of crime using panel data from North Carolina.” Journal of Applied Econometrics, 21(4).
Baltagi BH (2001). Econometric Analysis of Panel Data, 3rd edition. John Wiley and Sons ltd.
Baltagi BH (2013). Econometric Analysis of Panel Data, 5th edition. John Wiley and Sons ltd.
3 Gasoline Consumption
data(Gasoline)
Description
A panel of 18 observations from 1960 to 1978
Format
A data frame containing :
country
a factor with 18 levels
year
the year
lgaspcar
logarithm of motor gasoline consumption per car
lincomep
logarithm of real per-capita income
lrpmg
logarithm of real motor gasoline price
lcarpcap
logarithm of the stock of cars per capita
Details
total number of observations : 342
observation : country
country : OECD
Source
Online complements to Baltagi (2001):
https://www.wiley.com/legacy/wileychi/baltagi/
Online complements to Baltagi (2013):
https://bcs.wiley.com/he-bcs/Books?action=resource&bcsId=4338&itemId=1118672321&resourceId=13452
References
Baltagi BH (2001). Econometric Analysis of Panel Data, 3rd edition. John Wiley and Sons ltd.
Baltagi BH (2013). Econometric Analysis of Panel Data, 5th edition. John Wiley and Sons ltd.
Baltagi BH, Griffin JM (1983). “Gasoline demand in the OECD: An application of pooling and testing procedures.” European Economic Review, 22(2), 117 - 137. ISSN 0014-2921, https://www.sciencedirect.com/science/article/pii/0014292183900776.
4 Grunfeld’s Investment Data
data(Grunfeld)
Description
A balanced panel of 10 observational units (firms) from 1935 to 1954
Format
A data frame containing :
firm
observation
year
date
inv
gross Investment
value
value of the firm
capital
stock of plant and equipment
Details
total number of observations : 200
observation : production units
country : United States
Note
The Grunfeld data as provided in package plm is the same data as used in Baltagi (2001), see Examples below.
NB:
Various versions of the Grunfeld data circulate online. Also, various text books (and also varying among editions) and papers use different subsets of the original Grunfeld data, some of which contain errors in a few data points compared to the original data used by Grunfeld (1958) in his PhD thesis. See Kleiber/Zeileis (2010) and its accompanying website for a comparison of various Grunfeld data sets in use.
Source
Online complements to Baltagi (2001):
https://www.wiley.com/legacy/wileychi/baltagi/
https://www.wiley.com/legacy/wileychi/baltagi/supp/Grunfeld.fil
Online complements to Baltagi (2013):
https://bcs.wiley.com/he-bcs/Books?action=resource&bcsId=4338&itemId=1118672321&resourceId=13452
References
Baltagi BH (2001). Econometric Analysis of Panel Data, 3rd edition. John Wiley and Sons ltd.
Baltagi BH (2013). Econometric Analysis of Panel Data, 5th edition. John Wiley and Sons ltd.
Grunfeld Y (1958). The determinants of corporate investment. Ph.D. thesis, Department of Economics, University of Chicago.
Kleiber C, Zeileis A (2010). “The Grunfeld Data at 50.” German Economic Review, 11, 404-417. https://doi.org/10.1111/j.1468-0475.2010.00513.x.
website accompanying the paper with various variants of the Grunfeld data: https://www.zeileis.org/grunfeld/.
5 Wages and Hours Worked
data(LaborSupply)
Description
A panel of 532 observations from 1979 to 1988
Format
A data frame containing :
lnhr
log of annual hours worked
lnwg
log of hourly wage
kids
number of children
age
age
disab
bad health
id
id
year
year
Details
number of observations : 5320
Source
Online complements to Ziliak (1997).
Journal of Business Economics and Statistics web site: https://amstat.tandfonline.com/loi/ubes20/.
References
Colin Cameron A, K. Trivedi P (2005). Microeconometrics: Methods and Applications. Cambridge University Press. ISBN 0521848059, doi: 10.1017/CBO9780511811241.
Ziliak JP (1997). “Efficient Estimation with Panel Data When Instruments Are Predetermined: An Empirical Comparison of Moment-Condition Estimators.” Journal of Business & Economic Statistics, 15(4), 419–431. ISSN 07350015.
6 Wages and Education of Young Males
data(Males)
Description
A panel of 545 observations from 1980 to 1987
Format
A data frame containing :
nr
identifier
year
year
school
years of schooling
exper
years of experience (computed as age-6-school)
union
wage set by collective bargaining?
ethn
a factor with levels black, hisp, other
married
married?
health
health problem?
wage
log of hourly wage
industry
a factor with 12 levels
occupation
a factor with 9 levels
residence
a factor with levels rural_area, north_east, northern_central, south
Details
total number of observations : 4360
observation : individuals
country : United States
Source
Journal of Applied Econometrics data archive http://qed.econ.queensu.ca/jae/1998-v13.2/vella-verbeek/.
References
Vella F, Verbeek M (1998). “Whose wages do unions raise? A dynamic model of unionism and wage rate determination for young men.” Journal of Applied Econometrics, 13, 163–183.
Verbeek M (2004). A Guide to Modern Econometrics, 2nd edition. Wiley.
7 Purchasing Power Parity and other parity relationships
data(Parity )
Description
A panel of 104 quarterly observations from 1973Q1 to 1998Q4
Format
A data frame containing :
country
country codes: a factor with 17 levels
time
the quarter index, 1973Q1-1998Q4
ls
log spot exchange rate vs. USD
lp
log price level
is
short term interest rate
il
long term interest rate
ld
log price differential vs. USA
uis
U.S. short term interest rate
uil
U.S. long term interest rate
Details
total number of observations : 1768
observation : country
country : OECD
Source
Coakley J, Fuertes A, Smith R (2006). “Unobserved heterogeneity in panel time series models.” Computational Statistics & Data Analysis, 50(9), 2361–2380.
References
Coakley J, Fuertes A, Smith R (2006). “Unobserved heterogeneity in panel time series models.” Computational Statistics & Data Analysis, 50(9), 2361–2380.
Driscoll JC, Kraay AC (1998). “Consistent covariance matrix estimation with spatially dependent panel data.” Review of economics and statistics, 80(4), 549–560.
8 US States Production
data(Produc)
Description
A panel of 48 observations from 1970 to 1986
Format
A data frame containing :
state
the state
year
the year
region
the region
pcap
public capital stock
hwy
highway and streets
water
water and sewer facilities
util
other public buildings and structures
pc
private capital stock
gsp
gross state product
emp
labor input measured by the employment in non–agricultural payrolls
unemp
state unemployment rate
Details
total number of observations : 816
observation : regional
country : United States
Source
Online complements to Baltagi (2001):
https://www.wiley.com/legacy/wileychi/baltagi/
Online complements to Baltagi (2013):
https://bcs.wiley.com/he-bcs/Books?action=resource&bcsId=4338&itemId=1118672321&resourceId=13452
References
Baltagi BH (2001). Econometric Analysis of Panel Data, 3rd edition. John Wiley and Sons ltd.
Baltagi BH (2013). Econometric Analysis of Panel Data, 5th edition. John Wiley and Sons ltd.
Baltagi BH, Pinnoi N (1995). “Public capital stock and state productivity growth: further evidence from an error components model.” Empirical Economics, 20, 351-359.
Munnell A (1990). “Why Has Productivity Growth Declined? Productivity and Public Investment.” New England Economic Review, 3–22.
9 Production of Rice in Indonesia
data(RiceFarms)
Description
a panel of 171 observations
Format
A dataframe containing :
id
the farm identifier
size
the total area cultivated with rice, measured in hectares
status
land status, on of ‘owner’ (non sharecroppers, owner operators or leaseholders or both), ‘share’ (sharecroppers), ‘mixed’ (mixed of the two previous status)
varieties
one of ‘trad’ (traditional varieties), ‘high’ (high yielding varieties) and ‘mixed’ (mixed varieties)
bimas
bIMAS is an intensification program; one of ‘no’ (non-bimas farmer), ‘yes’ (bimas farmer) or ‘mixed’ (part but not all of farmer’s land was registered to be in the bimas program)
seed
seed in kilogram
urea
urea in kilogram
phosphate
phosphate in kilogram
pesticide
pesticide cost in Rupiah
pseed
price of seed in Rupiah per kg
purea
price of urea in Rupiah per kg
pphosph
price of phosphate in Rupiah per kg
hiredlabor
hired labor in hours
famlabor
family labor in hours
totlabor
total labor (excluding harvest labor)
wage
labor wage in Rupiah per hour
goutput
gross output of rice in kg
noutput
net output, gross output minus harvesting cost (paid in terms of rice)
price
price of rough rice in Rupiah per kg
region
one of ‘wargabinangun’, ‘langan’, ‘gunungwangi’, ‘malausma’, ‘sukaambit’, ‘ciwangi’
Details
number of observations : 1026
observation : farms
country : Indonesia
Source
Feng Q, Horrace WC (2012). “Alternative technical efficiency measures: Skew, bias and scale.” Journal of Applied Econometrics, 27(2), 253-268. doi: 10.1002/jae.1190, https://onlinelibrary.wiley.com/doi/pdf/10.1002/jae.1190, https://onlinelibrary.wiley.com/doi/abs/10.1002/jae.1190.
10 Employment and Wages in Spain
data(Snmesp)
Description
A panel of 738 observations from 1983 to 1990
Format
A data frame containing:
firm
firm index
year
year
n
log of employment
w
log of wages
y
log of real output
i
log of intermediate inputs
k
log of real capital stock
f
real cash flow
Details
total number of observations: 5904
observation: firms
country: Spain
Source
Journal of Business Economics and Statistics data archive:
https://amstat.tandfonline.com/loi/ubes20/.
References
Alonso-Borrego C, Arellano M (1999). “Symmetrically Normalized Instrumental-Variable Estimation Using Panel Data.” Journal of Business and Economic Statistics, 17(1), 36-49.
11 The Penn World Table, v. 5
data(SumHes)
Description
A panel of 125 observations from 1960 to 1985
Format
A data frame containing :
year
the year
country
the country name (factor)
opec
OPEC member?
com
communist regime?
pop
country’s population (in thousands)
gdp
real GDP per capita (in 1985 US dollars)
sr
saving rate (in percent)
Details
total number of observations : 3250
observation : country
country : World
Source
Online supplements to Hayashi (2000).
http://fhayashi.fc2web.com/datasets.htm
References
Hayashi F (2000). Econometrics. Princeton University Press.
Summers R, Heston A (1991). “The Penn World Table (Mark 5): An Expanded Set of International Comparisons, 1950–1988.” The Quarterly Journal of Economics, 106, 327-68. doi: 10.2307/2937941.
12 Panel Data of Individual Wages
data(Wages)
Description
A panel of 595 individuals from 1976 to 1982, taken from the Panel Study of Income Dynamics (PSID).
The data are organized as a stacked time series/balanced panel, see Examples on how to convert to a pdata.frame.
Format
A data frame containing:
exp
years of full-time work experience.
wks
weeks worked.
bluecol
blue collar?
ind
works in a manufacturing industry?
south
resides in the south?
smsa
resides in a standard metropolitan statistical area?
married
married?
sex
a factor with levels “male” and “female”
union
individual’s wage set by a union contract?
ed
years of education.
black
is the individual black?
lwage
logarithm of wage.
Details
total number of observations : 4165
observation : individuals
country : United States
Source
Online complements to Baltagi (2001):
https://www.wiley.com/legacy/wileychi/baltagi/
Online complements to Baltagi (2013):
https://bcs.wiley.com/he-bcs/Books?action=resource&bcsId=4338&itemId=1118672321&resourceId=13452
References
Baltagi BH (2001). Econometric Analysis of Panel Data, 3rd edition. John Wiley and Sons ltd.
Baltagi BH (2013). Econometric Analysis of Panel Data, 5th edition. John Wiley and Sons ltd.
Cornwell C, Rupert P (1988). “Efficient Estimation With Panel Data: an Empirical Comparison of Instrumental Variables Estimators.” Journal of Applied Econometrics, 3, 149–155.
经典的面板数据集(R语言包plm)相关推荐
- r语言 面板数据回归_工具方法 | “名牌包”:面板、时间序列模型常用R语言包...
计量经济学是数学.统计技术和经济分析的综合,即运用数学.统计方法和相关经济理论,通过计量模型来揭示经济数量关系和规律.R语言包,已经实现了现代计量经济学的很多统计分析功能,下面从面板数据模型和时间序列 ...
- R语言包在linux上的安装等知识
有关install.packages()函数的详见:R包 package 的安装(install.packages函数详解) R的包(package)通常有两种: 1 binary package:这 ...
- 在Linux中安装R语言包,遇到无法验证下列签名的错误
R语言包的安装方式如下 Ubuntu自带的APT包管理器中的R安装包总是落后于标准版,因此需要添加新的镜像源把APT包管理中的R安装包更新到最新版. 请登录Linux系统,打开一个终端,然后执行下面命 ...
- 安装低版本的R语言、和自行下载安装各个版本的R语言包、以及多环境运行R
1. 下载并安装最新的R版本软件,以国内的清华的镜像为例(国内镜像速度快) https://mirrors.tuna.tsinghua.edu.cn/CRAN/ 2. 官网上如何下载老版本或低版本(旧 ...
- python调用r语言加载包错误_Python中调用R语言包指南.docx
Python中调用R语言包指南R语言是非常强大的做统计分析和建模方面的开源软件,它有非常丰富的统计软件包,做统计可以说只有你想不到的,没有R办不到的.Python又是当下最流行的编程软件之一,Pyth ...
- linux下离线安装r语言包,CentOS下离线安装R语言Forecast包
1.下载Forecast包以及其依赖包,执行下面R脚本: getPackages packages tools::package_dependencies(packs, available.packa ...
- 10个令人相见恨晚的R语言包
大数据热火朝天的时代,大数据技术也成为香饽饽,收到研发的青睐.下面主要为大家介绍R语言.大约3年前我开始使用R,起初进展很慢,与我习惯的语言相比,语法更加直观也比较简单,而且需要一段时间才能习惯于细微 ...
- r语言remarkdown展示图_使用R语言包circlize可视化展示blast双序列比对结果
circlize这个包还挺强大的,R语言里用来画圈图还挺方便的. 今天这篇文章记录用circlize这个包画圈图展示blast双序列比对结果的代码 植物线粒体基因组类的文章通常会分析细胞器基因组间基因 ...
- R语言-包的安装、载入及使用方法
一.原理简述 包是R函数.数据.预编译代码以一种定义完善的格式组成的集合.计算机上存储包的目录称为库(library).函数.libPaths()能够显示库所在的位置,函数library()则可以显示 ...
- r语言和metawin_Windows下使用Rtools编译R语言包
使用devtools安装github中的R源代码时,经常会出各种错误,索性搜了一下怎么在Windows下直接打包,网上的资料也是参差不齐,以下是自己验证通过的. 一.下载Rtools 根据自己安装的R ...
最新文章
- 深度学习如何又好又快? Google发布最新《高效深度学习: 更小、更快、更好》综述...
- UNIX再学习 -- XSI IPC通信方式
- python简单可视化聊天界面_如何用Python制作可视化输入界面
- POJ2718【DFS】
- php删除oracle数据记录日志文件,Oracle手动切换日志文件和清空日志文件
- 加载mySQL数据到内存_【测试验证】数据库加载到内存占用大小
- python列表添加字符串_python字符串和列表操作
- python中的struct
- objective-C 的OOP(上)--类定义、继承及方法调用
- MyEclipse8.5添加tomcat7
- x299服务器芯片,X299比X399差在哪? 两大平台产品比拼
- unity如何实现图片透视_实战 | 自己实现扫描全能王
- 自定义组件时 Binary XML file line Error inflating class 异常
- IT自学网有视频教程
- 消费新品周报 | 奔富发布2019珍藏系列16款葡萄酒;欧乐-B推出AI电动牙刷
- SVG基本使用(三、剪裁/蒙版、渐变色、画笔、形变、ViewBox)
- 【干货分享|建议收藏】2w字爆肝详解 JavaScript对象
- elasticsearch你了解多少?
- RecyclerView中嵌套Edittext,添加监听的问题
- 趁表弟上厕所,我复习一下用CSS设置文字文本样式
热门文章
- 开源OLAP系统对比
- Windows系统开机自动运行程序和自动启动服务
- P3554 [POI2013]LUK-Triumphal arch
- 离职后的总结与感想(一)
- 这样做,才能留住员工的心
- android:scheme 常用类型,android scheme
- VMware Workstation安装windows xp系统并创建虚拟软盘
- C语言英尺和英寸换算米
- 宝宝专业智力测试软件,儿童医院心理科做智商检测-儿童智力测试仪_儿童注意力测试仪_儿童综合素质测试仪/测评系统...
- matlab作图excel,Excel表格数据怎么画matlab-如何用matlab把excel数据画成图像