美国人口分析

  • 读取csv文件中的数据
  • 使用merge进行数据融合
    • 当需要级联的属性名相同时
    • 当需要级联的属性名不同时
  • 删除一列数据:drop
  • 查看空数据,并根据不同情况进行相应处理
    • 数据清洗
      • 当空数据比例高时,对空数据进行赋值操作
      • 当空数据比例低,且难以赋值时,对数据进行删除操作
  • 级联之后的分析操作
    • 查看是否包含空数据,进行数据清洗
  • 对人口密度进行计算、级联
  • 用类似sql的功能进行数据查找

读取csv文件中的数据

要从csv类型文件中读取数据到pandas中,可以使用read_csv命令

  • 输入
import numpy as np
import pandas as pd
from pandas import Series,DataFrameareas = pd.read_csv('./state-areas.csv')
areas
areas.shapeabb = pd.read_csv('./state-abbrevs.csv')
abb
abb.shapepop = pd.read_csv('./state-population.csv')
pop
pop.shape
  • 输出
state    area (sq. mi)
0   Alabama 52423
1   Alaska  656425
2   Arizona 114006
3   Arkansas    53182
4   California  163707
5   Colorado    104100
6   Connecticut 5544
7   Delaware    1954
8   Florida 65758
9   Georgia 59441
10  Hawaii  10932
11  Idaho   83574
12  Illinois    57918
13  Indiana 36420
14  Iowa    56276
15  Kansas  82282
16  Kentucky    40411
17  Louisiana   51843
18  Maine   35387
19  Maryland    12407
20  Massachusetts   10555
21  Michigan    96810
22  Minnesota   86943
23  Mississippi 48434
24  Missouri    69709
25  Montana 147046
26  Nebraska    77358
27  Nevada  110567
28  New Hampshire   9351
29  New Jersey  8722
30  New Mexico  121593
31  New York    54475
32  North Carolina  53821
33  North Dakota    70704
34  Ohio    44828
35  Oklahoma    69903
36  Oregon  98386
37  Pennsylvania    46058
38  Rhode Island    1545
39  South Carolina  32007
40  South Dakota    77121
41  Tennessee   42146
42  Texas   268601
43  Utah    84904
44  Vermont 9615
45  Virginia    42769
46  Washington  71303
47  West Virginia   24231
48  Wisconsin   65503
49  Wyoming 97818
50  District of Columbia    68
51  Puerto Rico 3515(52, 2)state    abbreviation
0   Alabama AL
1   Alaska  AK
2   Arizona AZ
3   Arkansas    AR
4   California  CA
5   Colorado    CO
6   Connecticut CT
7   Delaware    DE
8   District of Columbia    DC
9   Florida FL
10  Georgia GA
11  Hawaii  HI
12  Idaho   ID
13  Illinois    IL
14  Indiana IN
15  Iowa    IA
16  Kansas  KS
17  Kentucky    KY
18  Louisiana   LA
19  Maine   ME
20  Montana MT
21  Nebraska    NE
22  Nevada  NV
23  New Hampshire   NH
24  New Jersey  NJ
25  New Mexico  NM
26  New York    NY
27  North Carolina  NC
28  North Dakota    ND
29  Ohio    OH
30  Oklahoma    OK
31  Oregon  OR
32  Maryland    MD
33  Massachusetts   MA
34  Michigan    MI
35  Minnesota   MN
36  Mississippi MS
37  Missouri    MO
38  Pennsylvania    PA
39  Rhode Island    RI
40  South Carolina  SC
41  South Dakota    SD
42  Tennessee   TN
43  Texas   TX
44  Utah    UT
45  Vermont VT
46  Virginia    VA
47  Washington  WA
48  West Virginia   WV
49  Wisconsin   WI
50  Wyoming WY
(51, 2)state/region ages    year    population
0   AL  under18 2012    1117489.0
1   AL  total   2012    4817528.0
2   AL  under18 2010    1130966.0
3   AL  total   2010    4785570.0
4   AL  under18 2011    1125763.0
5   AL  total   2011    4801627.0
6   AL  total   2009    4757938.0
7   AL  under18 2009    1134192.0
8   AL  under18 2013    1111481.0
9   AL  total   2013    4833722.0
10  AL  total   2007    4672840.0
11  AL  under18 2007    1132296.0
12  AL  total   2008    4718206.0
13  AL  under18 2008    1134927.0
14  AL  total   2005    4569805.0
15  AL  under18 2005    1117229.0
16  AL  total   2006    4628981.0
17  AL  under18 2006    1126798.0
18  AL  total   2004    4530729.0
19  AL  under18 2004    1113662.0
20  AL  total   2003    4503491.0
21  AL  under18 2003    1113083.0
22  AL  total   2001    4467634.0
23  AL  under18 2001    1120409.0
24  AL  total   2002    4480089.0
25  AL  under18 2002    1116590.0
26  AL  under18 1999    1121287.0
27  AL  total   1999    4430141.0
28  AL  total   2000    4452173.0
29  AL  under18 2000    1122273.0
... ... ... ... ...
2514    USA under18 1999    71946051.0
2515    USA total   2000    282162411.0
2516    USA under18 2000    72376189.0
2517    USA total   1999    279040181.0
2518    USA total   2001    284968955.0
2519    USA under18 2001    72671175.0
2520    USA total   2002    287625193.0
2521    USA under18 2002    72936457.0
2522    USA total   2003    290107933.0
2523    USA under18 2003    73100758.0
2524    USA total   2004    292805298.0
2525    USA under18 2004    73297735.0
2526    USA total   2005    295516599.0
2527    USA under18 2005    73523669.0
2528    USA total   2006    298379912.0
2529    USA under18 2006    73757714.0
2530    USA total   2007    301231207.0
2531    USA under18 2007    74019405.0
2532    USA total   2008    304093966.0
2533    USA under18 2008    74104602.0
2534    USA under18 2013    73585872.0
2535    USA total   2013    316128839.0
2536    USA total   2009    306771529.0
2537    USA under18 2009    74134167.0
2538    USA under18 2010    74119556.0
2539    USA total   2010    309326295.0
2540    USA under18 2011    73902222.0
2541    USA total   2011    311582564.0
2542    USA under18 2012    73708179.0
2543    USA total   2012    313873685.0
2544 rows × 4 columns(2544, 4)

使用merge进行数据融合

merge的使用场合:pop人口数据,数量特别多(包含历年各州数据),abbrevs数据少(等于美国州的数量)。可以利用pop中的state/region和abbrevs中的abbreviation建立对应关系,此时只能使用merge进行融合

当需要级联的属性名相同时

  • 输入
pop2 = pop.merge(abb,how = 'outer',left_on = 'state/region',right_on = 'abbreviation')
pop2.head()
pop2.shape
  • 输出
state/region ages    year    population  state   abbreviation
0   AL  under18 2012    1117489.0   Alabama AL
1   AL  total   2012    4817528.0   Alabama AL
2   AL  under18 2010    1130966.0   Alabama AL
3   AL  total   2010    4785570.0   Alabama AL
4   AL  under18 2011    1125763.0   Alabama ALpop2.shape

当需要级联的属性名不同时

  • 如果属性名字不同,那么我们需要告诉级联方法,级联时,分别根据哪个属性进行合并,left_on,right_on,merge融合方式
  • inner内连接(两个DataFrame都有keys时才会保留),outer(无论有没有都保留),outer连接融合不会出现数据丢失的情况,但可能出现空值。

删除一列数据:drop

  • 输入
pop2.drop(labels = 'abbreviation',axos = 1, inplace =True)
pop2.head()
  • 输出
state/region ages    year    population  state
0   AL  under18 2012    1117489.0   Alabama
1   AL  total   2012    4817528.0   Alabama
2   AL  under18 2010    1130966.0   Alabama
3   AL  total   2010    4785570.0   Alabama
4   AL  under18 2011    1125763.0   Alabama
  • 写inplace = True的作用是防止数据自动输出占用内存

查看空数据,并根据不同情况进行相应处理

  • 查看哪些数据为空,给判断:state这一列isnull().boolean类型的值,Series
  • 使用条件进行数据的检索,DataFrame的高级功能,DataFrame索引和切片操作,loc,iloc
  • df[cond] cond是条件,Series
  • 去重操作,保留非重复值:pop2[cond][‘state/region’].unique()

输入

pop2.isnull().any()
# 定位为空的数据
cond = pop2['state'].isnull()
cond
# 返回的数据只有当state为空时,返回,为空时True
pop2[cond]
# 去重操作,保留非重复值,查看有哪些州的人口数为空值
pop2[cond]['state/region'].unique()

输出

state/region    False
ages            False
year            False
population       True
state            True
dtype: bool0       False
1       False
2       False
3       False
4       False
5       False
6       False
7       False
8       False
9       False
10      False
11      False
12      False
13      False
14      False
15      False
16      False
17      False
18      False
19      False
20      False
21      False
22      False
23      False
24      False
25      False
26      False
27      False
28      False
29      False...
2514     True
2515     True
2516     True
2517     True
2518     True
2519     True
2520     True
2521     True
2522     True
2523     True
2524     True
2525     True
2526     True
2527     True
2528     True
2529     True
2530     True
2531     True
2532     True
2533     True
2534     True
2535     True
2536     True
2537     True
2538     True
2539     True
2540     True
2541     True
2542     True
2543     True
Name: state, Length: 2544, dtype: boolstate/region  ages    year    population  state
2448    PR  under18 1990    NaN NaN
2449    PR  total   1990    NaN NaN
2450    PR  total   1991    NaN NaN
2451    PR  under18 1991    NaN NaN
2452    PR  total   1993    NaN NaN
2453    PR  under18 1993    NaN NaN
2454    PR  under18 1992    NaN NaN
2455    PR  total   1992    NaN NaN
2456    PR  under18 1994    NaN NaN
2457    PR  total   1994    NaN NaN
2458    PR  total   1995    NaN NaN
2459    PR  under18 1995    NaN NaN
2460    PR  under18 1996    NaN NaN
2461    PR  total   1996    NaN NaN
2462    PR  under18 1998    NaN NaN
2463    PR  total   1998    NaN NaN
2464    PR  total   1997    NaN NaN
2465    PR  under18 1997    NaN NaN
2466    PR  total   1999    NaN NaN
2467    PR  under18 1999    NaN NaN
2468    PR  total   2000    3810605.0   NaN
2469    PR  under18 2000    1089063.0   NaN
2470    PR  total   2001    3818774.0   NaN
2471    PR  under18 2001    1077566.0   NaN
2472    PR  total   2002    3823701.0   NaN
2473    PR  under18 2002    1065051.0   NaN
2474    PR  total   2004    3826878.0   NaN
2475    PR  under18 2004    1035919.0   NaN
2476    PR  total   2003    3826095.0   NaN
2477    PR  under18 2003    1050615.0   NaN
... ... ... ... ... ...
2514    USA under18 1999    71946051.0  NaN
2515    USA total   2000    282162411.0 NaN
2516    USA under18 2000    72376189.0  NaN
2517    USA total   1999    279040181.0 NaN
2518    USA total   2001    284968955.0 NaN
2519    USA under18 2001    72671175.0  NaN
2520    USA total   2002    287625193.0 NaN
2521    USA under18 2002    72936457.0  NaN
2522    USA total   2003    290107933.0 NaN
2523    USA under18 2003    73100758.0  NaN
2524    USA total   2004    292805298.0 NaN
2525    USA under18 2004    73297735.0  NaN
2526    USA total   2005    295516599.0 NaN
2527    USA under18 2005    73523669.0  NaN
2528    USA total   2006    298379912.0 NaN
2529    USA under18 2006    73757714.0  NaN
2530    USA total   2007    301231207.0 NaN
2531    USA under18 2007    74019405.0  NaN
2532    USA total   2008    304093966.0 NaN
2533    USA under18 2008    74104602.0  NaN
2534    USA under18 2013    73585872.0  NaN
2535    USA total   2013    316128839.0 NaN
2536    USA total   2009    306771529.0 NaN
2537    USA under18 2009    74134167.0  NaN
2538    USA under18 2010    74119556.0  NaN
2539    USA total   2010    309326295.0 NaN
2540    USA under18 2011    73902222.0  NaN
2541    USA total   2011    311582564.0 NaN
2542    USA under18 2012    73708179.0  NaN
2543    USA total   2012    313873685.0 NaN
96 rows × 5 columns
array(['PR', 'USA'], dtype=object)

数据清洗

当空数据比例高时,对空数据进行赋值操作

  • 输入
# 找到哪些数据为空数据
cond = pop2['state/region'] == 'PR'
cond
# 赋值操作
pop2['state'][cond] = 'Puerto Rice'
cond = pop2['state/region'] == 'USA'
pop2['state'][cond]='United State'pop2.isnull().any()
  • 输出
0       False
1       False
2       False
3       False
4       False
5       False
6       False
7       False
8       False
9       False
10      False
11      False
12      False
13      False
14      False
15      False
16      False
17      False
18      False
19      False
20      False
21      False
22      False
23      False
24      False
25      False
26      False
27      False
28      False
29      False...
2514    False
2515    False
2516    False
2517    False
2518    False
2519    False
2520    False
2521    False
2522    False
2523    False
2524    False
2525    False
2526    False
2527    False
2528    False
2529    False
2530    False
2531    False
2532    False
2533    False
2534    False
2535    False
2536    False
2537    False
2538    False
2539    False
2540    False
2541    False
2542    False
2543    False
Name: state/region, Length: 2544, dtype: boolstate/region    False
ages            False
year            False
population       True
state           False
dtype: bool

当空数据比例低,且难以赋值时,对数据进行删除操作

  • pop2[‘state’][cond]='Puerto Rico’对空数据进行赋值
    #将难于进行补全的空数据进行删除,前提是空数据比例很少
  • pop2.dropna(inplace = True)

输入

# 查找,定位空数据
cond = pop2['population'].isnull()
pop[cond]
pop2[cond].shape
# 将难于进行补全的数据进行删除
pop2.dropna(inplace = True)
pop2.shape
pop2.isnull().any()
pop2.notnull().all()

输出

 state/region    ages    year    population
2448    PR  under18 1990    NaN
2449    PR  total   1990    NaN
2450    PR  total   1991    NaN
2451    PR  under18 1991    NaN
2452    PR  total   1993    NaN
2453    PR  under18 1993    NaN
2454    PR  under18 1992    NaN
2455    PR  total   1992    NaN
2456    PR  under18 1994    NaN
2457    PR  total   1994    NaN
2458    PR  total   1995    NaN
2459    PR  under18 1995    NaN
2460    PR  under18 1996    NaN
2461    PR  total   1996    NaN
2462    PR  under18 1998    NaN
2463    PR  total   1998    NaN
2464    PR  total   1997    NaN
2465    PR  under18 1997    NaN
2466    PR  total   1999    NaN
2467    PR  under18 1999    NaN(20, 5)
(2524, 5)state/region    False
ages            False
year            False
population      False
state           False
dtype: boolstate/region    True
ages            True
year            True
population      True
state           True
dtype: bool

级联之后的分析操作

查看是否包含空数据,进行数据清洗

  • 将缺失的美国面积数据进行填充

输入

# 进行级联
pop3 = pop2.merge(areas, how = 'outer')
pop3.shape
pop3.head()
# 查看是否有空数据
pop3.isnull().any()
# 查看空数据位置
cond = pop3['area (sq. mi)'].isnull()
pop3[cond]
# 对空数据进行填充
a = areas['area (sq. mi)'].sum()
apop3['state'] == 'United State'
pop3['area (sq. mi)'][cond] = a
# 再次检查是否有空数据
pop3.notnull().all()

输出

(2524, 6)state/region    ages    year    population  state   area (sq. mi)
0   AL  under18 2012.0  1117489.0   Alabama 52423.0
1   AL  total   2012.0  4817528.0   Alabama 52423.0
2   AL  under18 2010.0  1130966.0   Alabama 52423.0
3   AL  total   2010.0  4785570.0   Alabama 52423.0
4   AL  under18 2011.0  1125763.0   Alabama 52423.0
state/region     False
ages             False
year             False
population       False
state            False
area (sq. mi)     True
dtype: bool2476 USA under18 1990    64218512.0  United State    NaN
2477    USA total   1990    249622814.0 United State    NaN
... ... ... ... ... ... ...
2494    USA under18 1999    71946051.0  United State    NaN
2495    USA total   2000    282162411.0 United State    NaN
2496    USA under18 2000    72376189.0  United State    NaN
2497    USA total   1999    279040181.0 United State    NaN
2498    USA total   2001    284968955.0 United State    NaN
2499    USA under18 2001    72671175.0  United State    NaN
2500    USA total   2002    287625193.0 United State    NaN
2501    USA under18 2002    72936457.0  United State    NaN
2502    USA total   2003    290107933.0 United State    NaN
2503    USA under18 2003    73100758.0  United State    NaN
2504    USA total   2004    292805298.0 United State    NaN
2505    USA under18 2004    73297735.0  United State    NaN
2506    USA total   2005    295516599.0 United State    NaN
2507    USA under18 2005    73523669.0  United State    NaN
2508    USA total   2006    298379912.0 United State    NaN
2509    USA under18 2006    73757714.0  United State    NaN
2510    USA total   2007    301231207.0 United State    NaN
2511    USA under18 2007    74019405.0  United State    NaN
2512    USA total   2008    304093966.0 United State    NaN
2513    USA under18 2008    74104602.0  United State    NaN
2514    USA under18 2013    73585872.0  United State    NaN
2515    USA total   2013    316128839.0 United State    NaN
2516    USA total   2009    306771529.0 United State    NaN
2517    USA under18 2009    74134167.0  United State    NaN
2518    USA under18 2010    74119556.0  United State    NaN
2519    USA total   2010    309326295.0 United State    NaN
2520    USA under18 2011    73902222.0  United State    NaN
2521    USA total   2011    311582564.0 United State    NaN
2522    USA under18 2012    73708179.0  United State    NaN
2523    USA total   2012    313873685.0 United State    NaN3786884state/region     True
ages             True
year             True
population       True
state            True
area (sq. mi)    True
dtype: bool

对人口密度进行计算、级联

  • 输入
pop_density = (pop3['population']/pop3['area (sq. mi)']).round(1)
pop_density
pop_density = DataFrame(pop_density)
pop_density
pop_density.columns = ['pop_density']
pop_density.head()
pop4 = pop3.merge(pop_density,left_index = True, right_index = True)
pop4.head()
  • 输出
0       21.3
1       91.9
2       21.6
3       91.3
4       21.5
5       91.6
6       90.8
7       21.6
8       21.2
9       92.2
10      89.1
11      21.6
12      90.0
13      21.6
14      87.2
15      21.3
16      88.3
17      21.5
18      86.4
19      21.2
20      85.9
21      21.2
22      85.2
23      21.4
24      85.5
25      21.3
26      21.4
27      84.5
28      84.9
29      21.4...
2494    19.0
2495    74.4
2496    19.1
2497    73.6
2498    75.2
2499    19.2
2500    75.9
2501    19.2
2502    76.5
2503    19.3
2504    77.2
2505    19.3
2506    78.0
2507    19.4
2508    78.7
2509    19.5
2510    79.5
2511    19.5
2512    80.2
2513    19.6
2514    19.4
2515    83.4
2516    80.9
2517    19.6
2518    19.6
2519    81.6
2520    19.5
2521    82.2
2522    19.4
2523    82.8
Length: 2524, dtype: float640
0   21.3
1   91.9
2   21.6
3   91.3
4   21.5
5   91.6
6   90.8
7   21.6
8   21.2
9   92.2
10  89.1
11  21.6
12  90.0
13  21.6
14  87.2
15  21.3
16  88.3
17  21.5
18  86.4
19  21.2
20  85.9
21  21.2
22  85.2
23  21.4
24  85.5
25  21.3
26  21.4
27  84.5
28  84.9
29  21.4
... ...
2494    19.0
2495    74.4
2496    19.1
2497    73.6
2498    75.2
2499    19.2
2500    75.9
2501    19.2
2502    76.5
2503    19.3
2504    77.2
2505    19.3
2506    78.0
2507    19.4
2508    78.7
2509    19.5
2510    79.5
2511    19.5
2512    80.2
2513    19.6
2514    19.4
2515    83.4
2516    80.9
2517    19.6
2518    19.6
2519    81.6
2520    19.5
2521    82.2
2522    19.4
2523    82.8
2524 rows × 1 columnspop_density
0   21.3
1   91.9
2   21.6
3   91.3
4   21.5state/region    ages    year    population  state   area (sq. mi)   pop_density
0   AL  under18 2012    1117489.0   Alabama 52423.0 21.3
1   AL  total   2012    4817528.0   Alabama 52423.0 91.9
2   AL  under18 2010    1130966.0   Alabama 52423.0 21.6
3   AL  total   2010    4785570.0   Alabama 52423.0 91.3
4   AL  under18 2011    1125763.0   Alabama 52423.0 21.5

用类似sql的功能进行数据查找

  • 输入
# 查找2012年美国各州全民人口数据
pop4['year'].unique()
pop4['ages'].unique()
pop5 = pop4.query("year == 2012 and ages == 'total'")
pop5
# 改变列索引
pop5 = pop5.set_index(keys = 'state/region')
pop5
# 对人口密度进行排序
pop5.sort_values(by = 'pop_density')
pop5.sort_values(by = 'pop_density',ascending = False)
  • 输出
array([2012, 2010, 2011, 2009, 2013, 2007, 2008, 2005, 2006, 2004, 2003,2001, 2002, 1999, 2000, 1998, 1997, 1996, 1995, 1994, 1993, 1992,1991, 1990], dtype=int64)
array(['under18', 'total'], dtype=object)state/region  ages    year    population  state   area (sq. mi)   pop_density
1   AL  total   2012    4817528.0   Alabama 52423.0 91.9
95  AK  total   2012    730307.0    Alaska  656425.0    1.1
97  AZ  total   2012    6551149.0   Arizona 114006.0    57.5
191 AR  total   2012    2949828.0   Arkansas    53182.0 55.5
193 CA  total   2012    37999878.0  California  163707.0    232.1
287 CO  total   2012    5189458.0   Colorado    104100.0    49.9
289 CT  total   2012    3591765.0   Connecticut 5544.0  647.9
383 DE  total   2012    917053.0    Delaware    1954.0  469.3
385 DC  total   2012    633427.0    District of Columbia    68.0    9315.1
479 FL  total   2012    19320749.0  Florida 65758.0 293.8
480 GA  total   2012    9915646.0   Georgia 59441.0 166.8
575 HI  total   2012    1390090.0   Hawaii  10932.0 127.2
576 ID  total   2012    1595590.0   Idaho   83574.0 19.1
671 IL  total   2012    12868192.0  Illinois    57918.0 222.2
672 IN  total   2012    6537782.0   Indiana 36420.0 179.5
767 IA  total   2012    3075039.0   Iowa    56276.0 54.6
768 KS  total   2012    2885398.0   Kansas  82282.0 35.1
863 KY  total   2012    4379730.0   Kentucky    40411.0 108.4
864 LA  total   2012    4602134.0   Louisiana   51843.0 88.8
959 ME  total   2012    1328501.0   Maine   35387.0 37.5
960 MD  total   2012    5884868.0   Maryland    12407.0 474.3
1055    MA  total   2012    6645303.0   Massachusetts   10555.0 629.6
1056    MI  total   2012    9882519.0   Michigan    96810.0 102.1
1151    MN  total   2012    5379646.0   Minnesota   86943.0 61.9
1152    MS  total   2012    2986450.0   Mississippi 48434.0 61.7
1247    MO  total   2012    6024522.0   Missouri    69709.0 86.4
1248    MT  total   2012    1005494.0   Montana 147046.0    6.8
1343    NE  total   2012    1855350.0   Nebraska    77358.0 24.0
1344    NV  total   2012    2754354.0   Nevada  110567.0    24.9
1439    NH  total   2012    1321617.0   New Hampshire   9351.0  141.3
1440    NJ  total   2012    8867749.0   New Jersey  8722.0  1016.7
1535    NM  total   2012    2083540.0   New Mexico  121593.0    17.1
1536    NY  total   2012    19576125.0  New York    54475.0 359.4
1631    NC  total   2012    9748364.0   North Carolina  53821.0 181.1
1632    ND  total   2012    701345.0    North Dakota    70704.0 9.9
1727    OH  total   2012    11553031.0  Ohio    44828.0 257.7
1728    OK  total   2012    3815780.0   Oklahoma    69903.0 54.6
1823    OR  total   2012    3899801.0   Oregon  98386.0 39.6
1824    PA  total   2012    12764475.0  Pennsylvania    46058.0 277.1
1919    RI  total   2012    1050304.0   Rhode Island    1545.0  679.8
1920    SC  total   2012    4723417.0   South Carolina  32007.0 147.6
2015    SD  total   2012    834047.0    South Dakota    77121.0 10.8
2016    TN  total   2012    6454914.0   Tennessee   42146.0 153.2
2111    TX  total   2012    26060796.0  Texas   268601.0    97.0
2112    UT  total   2012    2854871.0   Utah    84904.0 33.6
2207    VT  total   2012    625953.0    Vermont 9615.0  65.1
2208    VA  total   2012    8186628.0   Virginia    42769.0 191.4
2303    WA  total   2012    6895318.0   Washington  71303.0 96.7
2304    WV  total   2012    1856680.0   West Virginia   24231.0 76.6
2399    WI  total   2012    5724554.0   Wisconsin   65503.0 87.4
2400    WY  total   2012    576626.0    Wyoming 97818.0 5.9
2475    PR  total   2012    3651545.0   Puerto Rice 3790399.0   1.0
2523    USA total   2012    313873685.0 United State    3790399.0   82.8ages    year    population  state   area (sq. mi)   pop_density
state/region
AL  total   2012    4817528.0   Alabama 52423.0 91.9
AK  total   2012    730307.0    Alaska  656425.0    1.1
AZ  total   2012    6551149.0   Arizona 114006.0    57.5
AR  total   2012    2949828.0   Arkansas    53182.0 55.5
CA  total   2012    37999878.0  California  163707.0    232.1
CO  total   2012    5189458.0   Colorado    104100.0    49.9
CT  total   2012    3591765.0   Connecticut 5544.0  647.9
DE  total   2012    917053.0    Delaware    1954.0  469.3
DC  total   2012    633427.0    District of Columbia    68.0    9315.1
FL  total   2012    19320749.0  Florida 65758.0 293.8
GA  total   2012    9915646.0   Georgia 59441.0 166.8
HI  total   2012    1390090.0   Hawaii  10932.0 127.2
ID  total   2012    1595590.0   Idaho   83574.0 19.1
IL  total   2012    12868192.0  Illinois    57918.0 222.2
IN  total   2012    6537782.0   Indiana 36420.0 179.5
IA  total   2012    3075039.0   Iowa    56276.0 54.6
KS  total   2012    2885398.0   Kansas  82282.0 35.1
KY  total   2012    4379730.0   Kentucky    40411.0 108.4
LA  total   2012    4602134.0   Louisiana   51843.0 88.8
ME  total   2012    1328501.0   Maine   35387.0 37.5
MD  total   2012    5884868.0   Maryland    12407.0 474.3
MA  total   2012    6645303.0   Massachusetts   10555.0 629.6
MI  total   2012    9882519.0   Michigan    96810.0 102.1
MN  total   2012    5379646.0   Minnesota   86943.0 61.9
MS  total   2012    2986450.0   Mississippi 48434.0 61.7
MO  total   2012    6024522.0   Missouri    69709.0 86.4
MT  total   2012    1005494.0   Montana 147046.0    6.8
NE  total   2012    1855350.0   Nebraska    77358.0 24.0
NV  total   2012    2754354.0   Nevada  110567.0    24.9
NH  total   2012    1321617.0   New Hampshire   9351.0  141.3
NJ  total   2012    8867749.0   New Jersey  8722.0  1016.7
NM  total   2012    2083540.0   New Mexico  121593.0    17.1
NY  total   2012    19576125.0  New York    54475.0 359.4
NC  total   2012    9748364.0   North Carolina  53821.0 181.1
ND  total   2012    701345.0    North Dakota    70704.0 9.9
OH  total   2012    11553031.0  Ohio    44828.0 257.7
OK  total   2012    3815780.0   Oklahoma    69903.0 54.6
OR  total   2012    3899801.0   Oregon  98386.0 39.6
PA  total   2012    12764475.0  Pennsylvania    46058.0 277.1
RI  total   2012    1050304.0   Rhode Island    1545.0  679.8
SC  total   2012    4723417.0   South Carolina  32007.0 147.6
SD  total   2012    834047.0    South Dakota    77121.0 10.8
TN  total   2012    6454914.0   Tennessee   42146.0 153.2
TX  total   2012    26060796.0  Texas   268601.0    97.0
UT  total   2012    2854871.0   Utah    84904.0 33.6
VT  total   2012    625953.0    Vermont 9615.0  65.1
VA  total   2012    8186628.0   Virginia    42769.0 191.4
WA  total   2012    6895318.0   Washington  71303.0 96.7
WV  total   2012    1856680.0   West Virginia   24231.0 76.6
WI  total   2012    5724554.0   Wisconsin   65503.0 87.4
WY  total   2012    576626.0    Wyoming 97818.0 5.9
PR  total   2012    3651545.0   Puerto Rice 3790399.0   1.0
USA total   2012    313873685.0 United State    3790399.0   82.8ages    year    population  state   area (sq. mi)   pop_density
state/region
PR  total   2012    3651545.0   Puerto Rice 3790399.0   1.0
AK  total   2012    730307.0    Alaska  656425.0    1.1
WY  total   2012    576626.0    Wyoming 97818.0 5.9
MT  total   2012    1005494.0   Montana 147046.0    6.8
ND  total   2012    701345.0    North Dakota    70704.0 9.9
SD  total   2012    834047.0    South Dakota    77121.0 10.8
NM  total   2012    2083540.0   New Mexico  121593.0    17.1
ID  total   2012    1595590.0   Idaho   83574.0 19.1
NE  total   2012    1855350.0   Nebraska    77358.0 24.0
NV  total   2012    2754354.0   Nevada  110567.0    24.9
UT  total   2012    2854871.0   Utah    84904.0 33.6
KS  total   2012    2885398.0   Kansas  82282.0 35.1
ME  total   2012    1328501.0   Maine   35387.0 37.5
OR  total   2012    3899801.0   Oregon  98386.0 39.6
CO  total   2012    5189458.0   Colorado    104100.0    49.9
OK  total   2012    3815780.0   Oklahoma    69903.0 54.6
IA  total   2012    3075039.0   Iowa    56276.0 54.6
AR  total   2012    2949828.0   Arkansas    53182.0 55.5
AZ  total   2012    6551149.0   Arizona 114006.0    57.5
MS  total   2012    2986450.0   Mississippi 48434.0 61.7
MN  total   2012    5379646.0   Minnesota   86943.0 61.9
VT  total   2012    625953.0    Vermont 9615.0  65.1
WV  total   2012    1856680.0   West Virginia   24231.0 76.6
USA total   2012    313873685.0 United State    3790399.0   82.8
MO  total   2012    6024522.0   Missouri    69709.0 86.4
WI  total   2012    5724554.0   Wisconsin   65503.0 87.4
LA  total   2012    4602134.0   Louisiana   51843.0 88.8
AL  total   2012    4817528.0   Alabama 52423.0 91.9
WA  total   2012    6895318.0   Washington  71303.0 96.7
TX  total   2012    26060796.0  Texas   268601.0    97.0
MI  total   2012    9882519.0   Michigan    96810.0 102.1
KY  total   2012    4379730.0   Kentucky    40411.0 108.4
HI  total   2012    1390090.0   Hawaii  10932.0 127.2
NH  total   2012    1321617.0   New Hampshire   9351.0  141.3
SC  total   2012    4723417.0   South Carolina  32007.0 147.6
TN  total   2012    6454914.0   Tennessee   42146.0 153.2
GA  total   2012    9915646.0   Georgia 59441.0 166.8
IN  total   2012    6537782.0   Indiana 36420.0 179.5
NC  total   2012    9748364.0   North Carolina  53821.0 181.1
VA  total   2012    8186628.0   Virginia    42769.0 191.4
IL  total   2012    12868192.0  Illinois    57918.0 222.2
CA  total   2012    37999878.0  California  163707.0    232.1
OH  total   2012    11553031.0  Ohio    44828.0 257.7
PA  total   2012    12764475.0  Pennsylvania    46058.0 277.1
FL  total   2012    19320749.0  Florida 65758.0 293.8
NY  total   2012    19576125.0  New York    54475.0 359.4
DE  total   2012    917053.0    Delaware    1954.0  469.3
MD  total   2012    5884868.0   Maryland    12407.0 474.3
MA  total   2012    6645303.0   Massachusetts   10555.0 629.6
CT  total   2012    3591765.0   Connecticut 5544.0  647.9
RI  total   2012    1050304.0   Rhode Island    1545.0  679.8
NJ  total   2012    8867749.0   New Jersey  8722.0  1016.7
DC  total   2012    633427.0    District of Columbia    68.0    9315.1ages  year    population  state   area (sq. mi)   pop_density
state/region
DC  total   2012    633427.0    District of Columbia    68.0    9315.1
NJ  total   2012    8867749.0   New Jersey  8722.0  1016.7
RI  total   2012    1050304.0   Rhode Island    1545.0  679.8
CT  total   2012    3591765.0   Connecticut 5544.0  647.9
MA  total   2012    6645303.0   Massachusetts   10555.0 629.6
MD  total   2012    5884868.0   Maryland    12407.0 474.3
DE  total   2012    917053.0    Delaware    1954.0  469.3
NY  total   2012    19576125.0  New York    54475.0 359.4
FL  total   2012    19320749.0  Florida 65758.0 293.8
PA  total   2012    12764475.0  Pennsylvania    46058.0 277.1
OH  total   2012    11553031.0  Ohio    44828.0 257.7
CA  total   2012    37999878.0  California  163707.0    232.1
IL  total   2012    12868192.0  Illinois    57918.0 222.2
VA  total   2012    8186628.0   Virginia    42769.0 191.4
NC  total   2012    9748364.0   North Carolina  53821.0 181.1
IN  total   2012    6537782.0   Indiana 36420.0 179.5
GA  total   2012    9915646.0   Georgia 59441.0 166.8
TN  total   2012    6454914.0   Tennessee   42146.0 153.2
SC  total   2012    4723417.0   South Carolina  32007.0 147.6
NH  total   2012    1321617.0   New Hampshire   9351.0  141.3
HI  total   2012    1390090.0   Hawaii  10932.0 127.2
KY  total   2012    4379730.0   Kentucky    40411.0 108.4
MI  total   2012    9882519.0   Michigan    96810.0 102.1
TX  total   2012    26060796.0  Texas   268601.0    97.0
WA  total   2012    6895318.0   Washington  71303.0 96.7
AL  total   2012    4817528.0   Alabama 52423.0 91.9
LA  total   2012    4602134.0   Louisiana   51843.0 88.8
WI  total   2012    5724554.0   Wisconsin   65503.0 87.4
MO  total   2012    6024522.0   Missouri    69709.0 86.4
USA total   2012    313873685.0 United State    3790399.0   82.8
WV  total   2012    1856680.0   West Virginia   24231.0 76.6
VT  total   2012    625953.0    Vermont 9615.0  65.1
MN  total   2012    5379646.0   Minnesota   86943.0 61.9
MS  total   2012    2986450.0   Mississippi 48434.0 61.7
AZ  total   2012    6551149.0   Arizona 114006.0    57.5
AR  total   2012    2949828.0   Arkansas    53182.0 55.5
OK  total   2012    3815780.0   Oklahoma    69903.0 54.6
IA  total   2012    3075039.0   Iowa    56276.0 54.6
CO  total   2012    5189458.0   Colorado    104100.0    49.9
OR  total   2012    3899801.0   Oregon  98386.0 39.6
ME  total   2012    1328501.0   Maine   35387.0 37.5
KS  total   2012    2885398.0   Kansas  82282.0 35.1
UT  total   2012    2854871.0   Utah    84904.0 33.6
NV  total   2012    2754354.0   Nevada  110567.0    24.9
NE  total   2012    1855350.0   Nebraska    77358.0 24.0
ID  total   2012    1595590.0   Idaho   83574.0 19.1
NM  total   2012    2083540.0   New Mexico  121593.0    17.1
SD  total   2012    834047.0    South Dakota    77121.0 10.8
ND  total   2012    701345.0    North Dakota    70704.0 9.9
MT  total   2012    1005494.0   Montana 147046.0    6.8
WY  total   2012    576626.0    Wyoming 97818.0 5.9
AK  total   2012    730307.0    Alaska  656425.0    1.1
PR  total   2012    3651545.0   Puerto Rice 3790399.0   1.0

pandas美国人口分析实例相关推荐

  1. python pandas读取excel-Python使用Pandas读写Excel实例解析

    这篇文章主要介绍了Python使用Pandas读写Excel实例解析,文中通过示例代码介绍的非常详细,对大家的学习或者工作具有一定的参考学习价值,需要的朋友可以参考下 Pandas是python的一个 ...

  2. Pandas数据分析⑥——数据分析实例(货品送达率与合格率/返修率/拒收率)

    Pandas系列目录如下: Pandas数据分析①--数据读取(CSV/TXT/JSON) Pandas数据分析②--数据清洗(重复值/缺失值/异常值) Pandas数据分析③--数据规整1(索引和列 ...

  3. pandas实践——美国人口分析

    1.导入文件,并查看数据样本 abbr = pd.read_csv("./state-abbrevs.csv") areas =pd.read_csv("./state- ...

  4. python 数据分析模块_Python数据分析pandas模块用法实例详解

    本文实例讲述了Python数据分析pandas模块用法.分享给大家供大家参考,具体如下: pandas pandas10分钟入门,可以查看官网:10 minutes to pandas 也可以查看更复 ...

  5. pandas 数据查询实例

    import pandas as pdfpath = "D:\\python39\\pandas\\antlearnpandasmaster\\datas\\beijing_tianqi\\ ...

  6. Pandas的get_dummies 实例的应用

    Pandas的get_dummies what?:get_dummies是Pandas进行独热编码(One-Hot encode)的函数 上图的左边灰色框(python debug出来的)是pytho ...

  7. 【Pandas】Pandas的时间与日期

    注意:重要且关键章节已标星 目录 概述 时间戳与时间跨度 转换为时间戳 提供格式参数(指定时间格式)* 从多个DataFrame列组装日期时间 无效数据 纪元时间戳 从时间戳到纪元 使用origin参 ...

  8. Pandas基础操作(上)

    文章目录 一.Pandas文件读取 1.pandas数据读取 1.读取纯文本文件 1.1 读取csv,使用默认的标题行.逗号分隔符 1.2 读取txt文件,自己指定分隔符.列名 2.读取excel文件 ...

  9. Pandas API参考

    本文为谷歌翻译,有些翻译不准确 请参考原文档 http://pandas.pydata.org/pandas-docs/stable/api.html 此页面概述了所有公共pandas对象,函数和方法 ...

最新文章

  1. JAVA游戏编程之二----j2me MIDlet 手机游戏入门开发--贪吃蛇
  2. 故事篇:数据库架构演变之路
  3. IE内嵌google chrome frame解决浏览器兼容问题
  4. 数学计算机培训反思,小学数学信息技术反思日志5
  5. LintCode MySQL 1932/1933. 挂科最多的同学 I / II
  6. 局域网内两台9303 的管理vlan mac地址冲突解决方法
  7. .NET回归 HTML----超文本标记语言(暂时无图)
  8. Hub与Switch的帧的广播细节
  9. Win7升Windows10有获取通知,但是就不推送的解决方法
  10. 相机拍照功能之权限和Android版本问题
  11. 那些有实力进入 BAT 的本科生,都做对了什么事?
  12. linux 百科百科手机版,百科题库app-百科题库知识集锦手机版下载v2.0.0-Linux公社...
  13. SVN客户端的安装与使用----第一章
  14. c语言 fft程序,FFT算法的使用说明与C语言版实现源码
  15. 飞思卡尔单片机DZ60---时钟初始化
  16. Facebook更名Meta,扎克伯格押注元宇宙
  17. 2.reflect.TypeOf()
  18. echarts异步数据加载(在下拉框选择事件中异步更新数据)
  19. 超声波水下通讯测试接收发射信号的完整性
  20. Android面试必过——Android常见的问题

热门文章

  1. carla学习笔记(三)
  2. github-一个视频图片下载工具-lux(annie)
  3. 股票短线详解,股票短线的方法技巧?
  4. 前端性能优化(一)性能优化的指标和工具
  5. Javafx顶级容器Stage(舞台)
  6. WPF教程(一)---创建一个WPF程序基础知识
  7. 企业为什么需要知识管理
  8. 慎用小封装大阻值电阻
  9. 图解LeetCode——768. 最多能完成排序的块 II(难度:困难)
  10. SpringBoot2.0Web开发实例(一)WebJars静态资源