python学习笔记(十)

<h1 style="text-align:center">泰坦尼克数据处理与分析 </h1>![](http://www.allengao.cn/wp-content/uploads/2018/06/Titanic.jpg)```python
import pandas as pd%matplotlib inline
```#### 导入数据```python
titanic = pd.read_csv('K:/Code/jupyter-notebook/Python Study/train.csv')
```#### 快速预览```python
titanic.head()
```<div>
<style scoped>.dataframe tbody tr th:only-of-type {vertical-align: middle;}.dataframe tbody tr th {vertical-align: top;}.dataframe thead th {text-align: right;}
</style>
<table border="1" class="dataframe"><thead><tr style="text-align: right;"><th></th><th>PassengerId</th><th>Survived</th><th>Pclass</th><th>Name</th><th>Sex</th><th>Age</th><th>SibSp</th><th>Parch</th><th>Ticket</th><th>Fare</th><th>Cabin</th><th>Embarked</th></tr></thead><tbody><tr><th>0</th><td>1</td><td>0</td><td>3</td><td>Braund, Mr. Owen Harris</td><td>male</td><td>22.0</td><td>1</td><td>0</td><td>A/5 21171</td><td>7.2500</td><td>NaN</td><td>S</td></tr><tr><th>1</th><td>2</td><td>1</td><td>1</td><td>Cumings, Mrs. John Bradley (Florence Briggs Th...</td><td>female</td><td>38.0</td><td>1</td><td>0</td><td>PC 17599</td><td>71.2833</td><td>C85</td><td>C</td></tr><tr><th>2</th><td>3</td><td>1</td><td>3</td><td>Heikkinen, Miss. Laina</td><td>female</td><td>26.0</td><td>0</td><td>0</td><td>STON/O2. 3101282</td><td>7.9250</td><td>NaN</td><td>S</td></tr><tr><th>3</th><td>4</td><td>1</td><td>1</td><td>Futrelle, Mrs. Jacques Heath (Lily May Peel)</td><td>female</td><td>35.0</td><td>1</td><td>0</td><td>113803</td><td>53.1000</td><td>C123</td><td>S</td></tr><tr><th>4</th><td>5</td><td>0</td><td>3</td><td>Allen, Mr. William Henry</td><td>male</td><td>35.0</td><td>0</td><td>0</td><td>373450</td><td>8.0500</td><td>NaN</td><td>S</td></tr></tbody>
</table>
</div>|单词|翻译|
|---|---|
|Passenger|社会阶层(1、精英;2、中层;3、船员/劳苦大众)|
|Survived|是否幸存|
|name|名字|
|sex|性别|
|age|年龄|
|sibsp|兄弟姐妹配偶个数 sibling spouse|
|parch|父母儿女个数|
|ticket|船票号|
|fare|船票价格|
|cabin|船舱|
|embarked|登船口|```python
titanic.info()
```<class 'pandas.core.frame.DataFrame'>RangeIndex: 891 entries, 0 to 890Data columns (total 12 columns):PassengerId    891 non-null int64Survived       891 non-null int64Pclass         891 non-null int64Name           891 non-null objectSex            891 non-null objectAge            714 non-null float64SibSp          891 non-null int64Parch          891 non-null int64Ticket         891 non-null objectFare           891 non-null float64Cabin          204 non-null objectEmbarked       889 non-null objectdtypes: float64(2), int64(5), object(5)memory usage: 83.6+ KB```python
# 把所有数值类型的数据做一个简单的统计
titanic.describe()
```<div>
<style scoped>.dataframe tbody tr th:only-of-type {vertical-align: middle;}.dataframe tbody tr th {vertical-align: top;}.dataframe thead th {text-align: right;}
</style>
<table border="1" class="dataframe"><thead><tr style="text-align: right;"><th></th><th>PassengerId</th><th>Survived</th><th>Pclass</th><th>Age</th><th>SibSp</th><th>Parch</th><th>Fare</th></tr></thead><tbody><tr><th>count</th><td>891.000000</td><td>891.000000</td><td>891.000000</td><td>714.000000</td><td>891.000000</td><td>891.000000</td><td>891.000000</td></tr><tr><th>mean</th><td>446.000000</td><td>0.383838</td><td>2.308642</td><td>29.699118</td><td>0.523008</td><td>0.381594</td><td>32.204208</td></tr><tr><th>std</th><td>257.353842</td><td>0.486592</td><td>0.836071</td><td>14.526497</td><td>1.102743</td><td>0.806057</td><td>49.693429</td></tr><tr><th>min</th><td>1.000000</td><td>0.000000</td><td>1.000000</td><td>0.420000</td><td>0.000000</td><td>0.000000</td><td>0.000000</td></tr><tr><th>25%</th><td>223.500000</td><td>0.000000</td><td>2.000000</td><td>20.125000</td><td>0.000000</td><td>0.000000</td><td>7.910400</td></tr><tr><th>50%</th><td>446.000000</td><td>0.000000</td><td>3.000000</td><td>28.000000</td><td>0.000000</td><td>0.000000</td><td>14.454200</td></tr><tr><th>75%</th><td>668.500000</td><td>1.000000</td><td>3.000000</td><td>38.000000</td><td>1.000000</td><td>0.000000</td><td>31.000000</td></tr><tr><th>max</th><td>891.000000</td><td>1.000000</td><td>3.000000</td><td>80.000000</td><td>8.000000</td><td>6.000000</td><td>512.329200</td></tr></tbody>
</table>
</div>```python
# isnull函数统计null值的个数
titanic.isnull().sum()
```PassengerId      0Survived         0Pclass           0Name             0Sex              0Age            177SibSp            0Parch            0Ticket           0Fare             0Cabin          687Embarked         2dtype: int64#### 处理空值```python
# 可以填充整个dataframe里面的空值,可以取消注释,试验一下
#titanic.fillna(0)
# 单独选择一列进行填充
#titanic.Age.fillna(0)# 求年龄的中位数
titanic.Age.median()#按年龄的中位数进行填充,此时返回一个新的series
# titanic.Age.fillna(titanic.Age.median())#直接填充,并不返回新的series
titanic.Age.fillna(titanic.Age.median(),inplace=True)# 在次查看Age的空值
titanic.isnull().sum()
```### 尝试从性别进行分析```python
# 做简单的汇总统计,经常用到
titanic.Sex.value_counts()
```male      577female    314Name: Sex, dtype: int64```python
# 生还者中,男女的人数
survived = titanic[titanic.Survived==1].Sex.value_counts()
``````python
# 未生还者中,男女的人数
dead = titanic[titanic.Survived==0].Sex.value_counts()
``````python
df = pd.DataFrame([survived,dead],index=['survived','dead'])
df.plot.bar()
```<matplotlib.axes._subplots.AxesSubplot at 0x1496afd27f0>![png](output_17_1.png)```python
# 绘图成功,但不是想要的效果
# 把dataframe转置一下,行列相互替换
df = df.T
df
```<div>
<style scoped>.dataframe tbody tr th:only-of-type {vertical-align: middle;}.dataframe tbody tr th {vertical-align: top;}.dataframe thead th {text-align: right;}
</style>
<table border="1" class="dataframe"><thead><tr style="text-align: right;"><th></th><th>survived</th><th>dead</th></tr></thead><tbody><tr><th>female</th><td>233</td><td>81</td></tr><tr><th>male</th><td>109</td><td>468</td></tr></tbody>
</table>
</div>```python
df.plot.bar() # df.plot(kind='bar')等价的
```<matplotlib.axes._subplots.AxesSubplot at 0x1496d1d7940>![png](output_19_1.png)```python
# 仍然不是我们想要的结果
df.plot(kind = 'bar',stacked = True)
```<matplotlib.axes._subplots.AxesSubplot at 0x1496d22aef0>![png](output_20_1.png)```python
# 男女中生还者的比例情况
df['p_survived'] = df.survived / (df.survived + df.dead)
df['p_dead'] = df.dead / (df.survived + df.dead)
df[['p_survived','p_dead']].plot.bar(stacked=True)
```<matplotlib.axes._subplots.AxesSubplot at 0x1496d2b7470>![png](output_21_1.png)#### 通过上面图片可以看出:性别特征对是否生还的影响还是挺大的### 尝试从年龄进行分析```python
# 简单统计
# titanic.Age.value_counts()
``````python
survived = titanic[titanic.Survived==1].Age
dead = titanic[titanic.Survived==0].Age
df =pd.DataFrame([survived,dead],index=['survived','dead'])
df = df.T
df.plot.hist(stacked=True)
```<matplotlib.axes._subplots.AxesSubplot at 0x1496d3c4be0>![png](output_25_1.png)```python
# 直方图柱子显示多一点
df.plot.hist(stacked = True,bins = 30)
# 中间很高的柱子,是因为我们把空值都替换为了中位数
```<matplotlib.axes._subplots.AxesSubplot at 0x1496e42f588>![png](output_26_1.png)```python
# 密度图,更直观一点
df.plot.kde()
```<matplotlib.axes._subplots.AxesSubplot at 0x1496e4c7dd8>![png](output_27_1.png)```python
# 可以查看年龄的分布,来决定图片横轴的取值范围
titanic.Age.describe()
```count    891.000000mean      29.361582std       13.019697min        0.42000025%       22.00000050%       28.00000075%       35.000000max       80.000000Name: Age, dtype: float64```python
# 限定范围
df.plot.kde(xlim=(0,80))
```<matplotlib.axes._subplots.AxesSubplot at 0x1496e511c18>![png](output_29_1.png)```python
age = 16
young = titanic[titanic.Age<=age]['Survived'].value_counts()
old = titanic[titanic.Age>age]['Survived'].value_counts()
df = pd.DataFrame([young,old],index = ['young','old'])
df.columns = ['dead','survived']
df.plot.bar(stacked = True)
```<matplotlib.axes._subplots.AxesSubplot at 0x1496f3a3b70>![png](output_30_1.png)```python
# 大于16岁和小于等于16岁中生还者的比例情况
df['p_survived'] = df.survived / (df.survived + df.dead)
df['p_dead'] = df.dead / (df.survived + df.dead)
df[['p_survived','p_dead']].plot.bar(stacked=True)
```<matplotlib.axes._subplots.AxesSubplot at 0x1496f407c50>![png](output_31_1.png)### 分析票价```python
# 票价和年龄特征相似
survived = titanic[titanic.Survived==1].Fare
dead = titanic[titanic.Survived==0].Fare
df = pd.DataFrame([survived,dead],index = ['survived','dead'])
df = df.T
df.plot.kde()
```<matplotlib.axes._subplots.AxesSubplot at 0x1496f47b978>![png](output_33_1.png)```python
# 设定xlim范围,先查看票价的范围
titanic.Fare.describe()
```count    891.000000mean      32.204208std       49.693429min        0.00000025%        7.91040050%       14.45420075%       31.000000max      512.329200Name: Fare, dtype: float64```python
df.plot(kind = 'kde',xlim = (0,513))
```<matplotlib.axes._subplots.AxesSubplot at 0x1496f45bba8>![png](output_35_1.png)#### 可以看出低票价的人生还率比较低### 组合特征```python
# 比如同时查看年龄和票价对生还率的影响
import matplotlib.pyplot as pltplt.scatter(titanic[titanic.Survived==0].Age, titanic[titanic.Survived==0].Fare)
```<matplotlib.collections.PathCollection at 0x1496f597a58>![png](output_38_1.png)```python
# 不美观
ax = plt.subplot()# 未生还者
age = titanic[titanic.Survived==0].Age
fare = titanic[titanic.Survived==0].Fare
plt.scatter(age, fare,s=20,alpha=0.3,linewidths=1,edgecolors='gray')#生还者
age = titanic[titanic.Survived==1].Age
fare = titanic[titanic.Survived==1].Fare
plt.scatter(age, fare,s=20,alpha=0.3,linewidths=1,edgecolors='red')
ax.set_xlabel('age')
ax.set_ylabel('fare')
```Text(0,0.5,'fare')![png](output_39_1.png)```python
# 生还者
ax = plt.subplot()
age = titanic[titanic.Survived==1].Age
fare = titanic[titanic.Survived==1].Fare
plt.scatter(age, fare,s=20,alpha=0.5,linewidths=1,edgecolors='red')
ax.set_xlabel('age')
ax.set_ylabel('fare')
```Text(0,0.5,'fare')![png](output_40_1.png)### 隐含特征```python
#提取称呼Mr Mrs Miss
titanic.Name
```0                                Braund, Mr. Owen Harris1      Cumings, Mrs. John Bradley (Florence Briggs Th...2                                 Heikkinen, Miss. Laina3           Futrelle, Mrs. Jacques Heath (Lily May Peel)4                               Allen, Mr. William Henry5                                       Moran, Mr. James6                                McCarthy, Mr. Timothy J7                         Palsson, Master. Gosta Leonard8      Johnson, Mrs. Oscar W (Elisabeth Vilhelmina Berg)9                    Nasser, Mrs. Nicholas (Adele Achem)10                       Sandstrom, Miss. Marguerite Rut11                              Bonnell, Miss. Elizabeth12                        Saundercock, Mr. William Henry13                           Andersson, Mr. Anders Johan14                  Vestrom, Miss. Hulda Amanda Adolfina15                      Hewlett, Mrs. (Mary D Kingcome) 16                                  Rice, Master. Eugene17                          Williams, Mr. Charles Eugene18     Vander Planke, Mrs. Julius (Emelia Maria Vande...19                               Masselmani, Mrs. Fatima20                                  Fynney, Mr. Joseph J21                                 Beesley, Mr. Lawrence22                           McGowan, Miss. Anna "Annie"23                          Sloper, Mr. William Thompson24                         Palsson, Miss. Torborg Danira25     Asplund, Mrs. Carl Oscar (Selma Augusta Emilia...26                               Emir, Mr. Farred Chehab27                        Fortune, Mr. Charles Alexander28                         O'Dwyer, Miss. Ellen "Nellie"29                                   Todoroff, Mr. Lalio...                        861                          Giles, Mr. Frederick Edward862    Swift, Mrs. Frederick Joel (Margaret Welles Ba...863                    Sage, Miss. Dorothy Edith "Dolly"864                               Gill, Mr. John William865                             Bystrom, Mrs. (Karolina)866                         Duran y More, Miss. Asuncion867                 Roebling, Mr. Washington Augustus II868                          van Melkebeke, Mr. Philemon869                      Johnson, Master. Harold Theodor870                                    Balkic, Mr. Cerin871     Beckwith, Mrs. Richard Leonard (Sallie Monypeny)872                             Carlsson, Mr. Frans Olof873                          Vander Cruyssen, Mr. Victor874                Abelson, Mrs. Samuel (Hannah Wizosky)875                     Najib, Miss. Adele Kiamie "Jane"876                        Gustafsson, Mr. Alfred Ossian877                                 Petroff, Mr. Nedelio878                                   Laleff, Mr. Kristo879        Potter, Mrs. Thomas Jr (Lily Alexenia Wilson)880         Shelley, Mrs. William (Imanita Parrish Hall)881                                   Markun, Mr. Johann882                         Dahlberg, Miss. Gerda Ulrika883                        Banfield, Mr. Frederick James884                               Sutehall, Mr. Henry Jr885                 Rice, Mrs. William (Margaret Norton)886                                Montvila, Rev. Juozas887                         Graham, Miss. Margaret Edith888             Johnston, Miss. Catherine Helen "Carrie"889                                Behr, Mr. Karl Howell890                                  Dooley, Mr. PatrickName: Name, Length: 891, dtype: object```python
titanic['title'] = titanic.Name.apply(lambda name: name.split(',')[1].split('.')[0].strip())
``````python
s= 'Williams, Mr.Howard Hugh "harry"'
s.split(',')[-1].split('.')[0].strip()
```'Mr'```python
titanic.title.value_counts()
# 比如有一个人称呼是Mr,而年龄是不可知的,这个时候可以用所有Mr的年龄平均值来替代,
# 而不是用我们之前最简单的所有数据的中位数。
```Mr              517Miss            182Mrs             125Master           40Dr                7Rev               6Mlle              2Major             2Col               2Capt              1Ms                1Mme               1Jonkheer          1the Countess      1Don               1Lady              1Sir               1Name: title, dtype: int64### GDP```python
### 夜光图,简单用灯光图的亮度来模拟这个GDP
``````python
titanic.head()
```<div>
<style scoped>.dataframe tbody tr th:only-of-type {vertical-align: middle;}.dataframe tbody tr th {vertical-align: top;}.dataframe thead th {text-align: right;}
</style>
<table border="1" class="dataframe"><thead><tr style="text-align: right;"><th></th><th>PassengerId</th><th>Survived</th><th>Pclass</th><th>Name</th><th>Sex</th><th>Age</th><th>SibSp</th><th>Parch</th><th>Ticket</th><th>Fare</th><th>Cabin</th><th>Embarked</th><th>title</th></tr></thead><tbody><tr><th>0</th><td>1</td><td>0</td><td>3</td><td>Braund, Mr. Owen Harris</td><td>male</td><td>22.0</td><td>1</td><td>0</td><td>A/5 21171</td><td>7.2500</td><td>NaN</td><td>S</td><td>Mr</td></tr><tr><th>1</th><td>2</td><td>1</td><td>1</td><td>Cumings, Mrs. John Bradley (Florence Briggs Th...</td><td>female</td><td>38.0</td><td>1</td><td>0</td><td>PC 17599</td><td>71.2833</td><td>C85</td><td>C</td><td>Mrs</td></tr><tr><th>2</th><td>3</td><td>1</td><td>3</td><td>Heikkinen, Miss. Laina</td><td>female</td><td>26.0</td><td>0</td><td>0</td><td>STON/O2. 3101282</td><td>7.9250</td><td>NaN</td><td>S</td><td>Miss</td></tr><tr><th>3</th><td>4</td><td>1</td><td>1</td><td>Futrelle, Mrs. Jacques Heath (Lily May Peel)</td><td>female</td><td>35.0</td><td>1</td><td>0</td><td>113803</td><td>53.1000</td><td>C123</td><td>S</td><td>Mrs</td></tr><tr><th>4</th><td>5</td><td>0</td><td>3</td><td>Allen, Mr. William Henry</td><td>male</td><td>35.0</td><td>0</td><td>0</td><td>373450</td><td>8.0500</td><td>NaN</td><td>S</td><td>Mr</td></tr></tbody>
</table>
</div>```python
titanic['family_size'] = titanic.SibSp + titanic.Parch + 1
``````python
titanic
```<div>
<style scoped>.dataframe tbody tr th:only-of-type {vertical-align: middle;}.dataframe tbody tr th {vertical-align: top;}.dataframe thead th {text-align: right;}
</style>
<table border="1" class="dataframe"><thead><tr style="text-align: right;"><th></th><th>PassengerId</th><th>Survived</th><th>Pclass</th><th>Name</th><th>Sex</th><th>Age</th><th>SibSp</th><th>Parch</th><th>Ticket</th><th>Fare</th><th>Cabin</th><th>Embarked</th><th>title</th><th>family_size</th></tr></thead><tbody><tr><th>0</th><td>1</td><td>0</td><td>3</td><td>Braund, Mr. Owen Harris</td><td>male</td><td>22.0</td><td>1</td><td>0</td><td>A/5 21171</td><td>7.2500</td><td>NaN</td><td>S</td><td>Mr</td><td>2</td></tr><tr><th>1</th><td>2</td><td>1</td><td>1</td><td>Cumings, Mrs. John Bradley (Florence Briggs Th...</td><td>female</td><td>38.0</td><td>1</td><td>0</td><td>PC 17599</td><td>71.2833</td><td>C85</td><td>C</td><td>Mrs</td><td>2</td></tr><tr><th>2</th><td>3</td><td>1</td><td>3</td><td>Heikkinen, Miss. Laina</td><td>female</td><td>26.0</td><td>0</td><td>0</td><td>STON/O2. 3101282</td><td>7.9250</td><td>NaN</td><td>S</td><td>Miss</td><td>1</td></tr><tr><th>3</th><td>4</td><td>1</td><td>1</td><td>Futrelle, Mrs. Jacques Heath (Lily May Peel)</td><td>female</td><td>35.0</td><td>1</td><td>0</td><td>113803</td><td>53.1000</td><td>C123</td><td>S</td><td>Mrs</td><td>2</td></tr><tr><th>4</th><td>5</td><td>0</td><td>3</td><td>Allen, Mr. William Henry</td><td>male</td><td>35.0</td><td>0</td><td>0</td><td>373450</td><td>8.0500</td><td>NaN</td><td>S</td><td>Mr</td><td>1</td></tr><tr><th>5</th><td>6</td><td>0</td><td>3</td><td>Moran, Mr. James</td><td>male</td><td>28.0</td><td>0</td><td>0</td><td>330877</td><td>8.4583</td><td>NaN</td><td>Q</td><td>Mr</td><td>1</td></tr><tr><th>6</th><td>7</td><td>0</td><td>1</td><td>McCarthy, Mr. Timothy J</td><td>male</td><td>54.0</td><td>0</td><td>0</td><td>17463</td><td>51.8625</td><td>E46</td><td>S</td><td>Mr</td><td>1</td></tr><tr><th>7</th><td>8</td><td>0</td><td>3</td><td>Palsson, Master. Gosta Leonard</td><td>male</td><td>2.0</td><td>3</td><td>1</td><td>349909</td><td>21.0750</td><td>NaN</td><td>S</td><td>Master</td><td>5</td></tr><tr><th>8</th><td>9</td><td>1</td><td>3</td><td>Johnson, Mrs. Oscar W (Elisabeth Vilhelmina Berg)</td><td>female</td><td>27.0</td><td>0</td><td>2</td><td>347742</td><td>11.1333</td><td>NaN</td><td>S</td><td>Mrs</td><td>3</td></tr><tr><th>9</th><td>10</td><td>1</td><td>2</td><td>Nasser, Mrs. Nicholas (Adele Achem)</td><td>female</td><td>14.0</td><td>1</td><td>0</td><td>237736</td><td>30.0708</td><td>NaN</td><td>C</td><td>Mrs</td><td>2</td></tr><tr><th>10</th><td>11</td><td>1</td><td>3</td><td>Sandstrom, Miss. Marguerite Rut</td><td>female</td><td>4.0</td><td>1</td><td>1</td><td>PP 9549</td><td>16.7000</td><td>G6</td><td>S</td><td>Miss</td><td>3</td></tr><tr><th>11</th><td>12</td><td>1</td><td>1</td><td>Bonnell, Miss. Elizabeth</td><td>female</td><td>58.0</td><td>0</td><td>0</td><td>113783</td><td>26.5500</td><td>C103</td><td>S</td><td>Miss</td><td>1</td></tr><tr><th>12</th><td>13</td><td>0</td><td>3</td><td>Saundercock, Mr. William Henry</td><td>male</td><td>20.0</td><td>0</td><td>0</td><td>A/5. 2151</td><td>8.0500</td><td>NaN</td><td>S</td><td>Mr</td><td>1</td></tr><tr><th>13</th><td>14</td><td>0</td><td>3</td><td>Andersson, Mr. Anders Johan</td><td>male</td><td>39.0</td><td>1</td><td>5</td><td>347082</td><td>31.2750</td><td>NaN</td><td>S</td><td>Mr</td><td>7</td></tr><tr><th>14</th><td>15</td><td>0</td><td>3</td><td>Vestrom, Miss. Hulda Amanda Adolfina</td><td>female</td><td>14.0</td><td>0</td><td>0</td><td>350406</td><td>7.8542</td><td>NaN</td><td>S</td><td>Miss</td><td>1</td></tr><tr><th>15</th><td>16</td><td>1</td><td>2</td><td>Hewlett, Mrs. (Mary D Kingcome)</td><td>female</td><td>55.0</td><td>0</td><td>0</td><td>248706</td><td>16.0000</td><td>NaN</td><td>S</td><td>Mrs</td><td>1</td></tr><tr><th>16</th><td>17</td><td>0</td><td>3</td><td>Rice, Master. Eugene</td><td>male</td><td>2.0</td><td>4</td><td>1</td><td>382652</td><td>29.1250</td><td>NaN</td><td>Q</td><td>Master</td><td>6</td></tr><tr><th>17</th><td>18</td><td>1</td><td>2</td><td>Williams, Mr. Charles Eugene</td><td>male</td><td>28.0</td><td>0</td><td>0</td><td>244373</td><td>13.0000</td><td>NaN</td><td>S</td><td>Mr</td><td>1</td></tr><tr><th>18</th><td>19</td><td>0</td><td>3</td><td>Vander Planke, Mrs. Julius (Emelia Maria Vande...</td><td>female</td><td>31.0</td><td>1</td><td>0</td><td>345763</td><td>18.0000</td><td>NaN</td><td>S</td><td>Mrs</td><td>2</td></tr><tr><th>19</th><td>20</td><td>1</td><td>3</td><td>Masselmani, Mrs. Fatima</td><td>female</td><td>28.0</td><td>0</td><td>0</td><td>2649</td><td>7.2250</td><td>NaN</td><td>C</td><td>Mrs</td><td>1</td></tr><tr><th>20</th><td>21</td><td>0</td><td>2</td><td>Fynney, Mr. Joseph J</td><td>male</td><td>35.0</td><td>0</td><td>0</td><td>239865</td><td>26.0000</td><td>NaN</td><td>S</td><td>Mr</td><td>1</td></tr><tr><th>21</th><td>22</td><td>1</td><td>2</td><td>Beesley, Mr. Lawrence</td><td>male</td><td>34.0</td><td>0</td><td>0</td><td>248698</td><td>13.0000</td><td>D56</td><td>S</td><td>Mr</td><td>1</td></tr><tr><th>22</th><td>23</td><td>1</td><td>3</td><td>McGowan, Miss. Anna "Annie"</td><td>female</td><td>15.0</td><td>0</td><td>0</td><td>330923</td><td>8.0292</td><td>NaN</td><td>Q</td><td>Miss</td><td>1</td></tr><tr><th>23</th><td>24</td><td>1</td><td>1</td><td>Sloper, Mr. William Thompson</td><td>male</td><td>28.0</td><td>0</td><td>0</td><td>113788</td><td>35.5000</td><td>A6</td><td>S</td><td>Mr</td><td>1</td></tr><tr><th>24</th><td>25</td><td>0</td><td>3</td><td>Palsson, Miss. Torborg Danira</td><td>female</td><td>8.0</td><td>3</td><td>1</td><td>349909</td><td>21.0750</td><td>NaN</td><td>S</td><td>Miss</td><td>5</td></tr><tr><th>25</th><td>26</td><td>1</td><td>3</td><td>Asplund, Mrs. Carl Oscar (Selma Augusta Emilia...</td><td>female</td><td>38.0</td><td>1</td><td>5</td><td>347077</td><td>31.3875</td><td>NaN</td><td>S</td><td>Mrs</td><td>7</td></tr><tr><th>26</th><td>27</td><td>0</td><td>3</td><td>Emir, Mr. Farred Chehab</td><td>male</td><td>28.0</td><td>0</td><td>0</td><td>2631</td><td>7.2250</td><td>NaN</td><td>C</td><td>Mr</td><td>1</td></tr><tr><th>27</th><td>28</td><td>0</td><td>1</td><td>Fortune, Mr. Charles Alexander</td><td>male</td><td>19.0</td><td>3</td><td>2</td><td>19950</td><td>263.0000</td><td>C23 C25 C27</td><td>S</td><td>Mr</td><td>6</td></tr><tr><th>28</th><td>29</td><td>1</td><td>3</td><td>O'Dwyer, Miss. Ellen "Nellie"</td><td>female</td><td>28.0</td><td>0</td><td>0</td><td>330959</td><td>7.8792</td><td>NaN</td><td>Q</td><td>Miss</td><td>1</td></tr><tr><th>29</th><td>30</td><td>0</td><td>3</td><td>Todoroff, Mr. Lalio</td><td>male</td><td>28.0</td><td>0</td><td>0</td><td>349216</td><td>7.8958</td><td>NaN</td><td>S</td><td>Mr</td><td>1</td></tr><tr><th>...</th><td>...</td><td>...</td><td>...</td><td>...</td><td>...</td><td>...</td><td>...</td><td>...</td><td>...</td><td>...</td><td>...</td><td>...</td><td>...</td><td>...</td></tr><tr><th>861</th><td>862</td><td>0</td><td>2</td><td>Giles, Mr. Frederick Edward</td><td>male</td><td>21.0</td><td>1</td><td>0</td><td>28134</td><td>11.5000</td><td>NaN</td><td>S</td><td>Mr</td><td>2</td></tr><tr><th>862</th><td>863</td><td>1</td><td>1</td><td>Swift, Mrs. Frederick Joel (Margaret Welles Ba...</td><td>female</td><td>48.0</td><td>0</td><td>0</td><td>17466</td><td>25.9292</td><td>D17</td><td>S</td><td>Mrs</td><td>1</td></tr><tr><th>863</th><td>864</td><td>0</td><td>3</td><td>Sage, Miss. Dorothy Edith "Dolly"</td><td>female</td><td>28.0</td><td>8</td><td>2</td><td>CA. 2343</td><td>69.5500</td><td>NaN</td><td>S</td><td>Miss</td><td>11</td></tr><tr><th>864</th><td>865</td><td>0</td><td>2</td><td>Gill, Mr. John William</td><td>male</td><td>24.0</td><td>0</td><td>0</td><td>233866</td><td>13.0000</td><td>NaN</td><td>S</td><td>Mr</td><td>1</td></tr><tr><th>865</th><td>866</td><td>1</td><td>2</td><td>Bystrom, Mrs. (Karolina)</td><td>female</td><td>42.0</td><td>0</td><td>0</td><td>236852</td><td>13.0000</td><td>NaN</td><td>S</td><td>Mrs</td><td>1</td></tr><tr><th>866</th><td>867</td><td>1</td><td>2</td><td>Duran y More, Miss. Asuncion</td><td>female</td><td>27.0</td><td>1</td><td>0</td><td>SC/PARIS 2149</td><td>13.8583</td><td>NaN</td><td>C</td><td>Miss</td><td>2</td></tr><tr><th>867</th><td>868</td><td>0</td><td>1</td><td>Roebling, Mr. Washington Augustus II</td><td>male</td><td>31.0</td><td>0</td><td>0</td><td>PC 17590</td><td>50.4958</td><td>A24</td><td>S</td><td>Mr</td><td>1</td></tr><tr><th>868</th><td>869</td><td>0</td><td>3</td><td>van Melkebeke, Mr. Philemon</td><td>male</td><td>28.0</td><td>0</td><td>0</td><td>345777</td><td>9.5000</td><td>NaN</td><td>S</td><td>Mr</td><td>1</td></tr><tr><th>869</th><td>870</td><td>1</td><td>3</td><td>Johnson, Master. Harold Theodor</td><td>male</td><td>4.0</td><td>1</td><td>1</td><td>347742</td><td>11.1333</td><td>NaN</td><td>S</td><td>Master</td><td>3</td></tr><tr><th>870</th><td>871</td><td>0</td><td>3</td><td>Balkic, Mr. Cerin</td><td>male</td><td>26.0</td><td>0</td><td>0</td><td>349248</td><td>7.8958</td><td>NaN</td><td>S</td><td>Mr</td><td>1</td></tr><tr><th>871</th><td>872</td><td>1</td><td>1</td><td>Beckwith, Mrs. Richard Leonard (Sallie Monypeny)</td><td>female</td><td>47.0</td><td>1</td><td>1</td><td>11751</td><td>52.5542</td><td>D35</td><td>S</td><td>Mrs</td><td>3</td></tr><tr><th>872</th><td>873</td><td>0</td><td>1</td><td>Carlsson, Mr. Frans Olof</td><td>male</td><td>33.0</td><td>0</td><td>0</td><td>695</td><td>5.0000</td><td>B51 B53 B55</td><td>S</td><td>Mr</td><td>1</td></tr><tr><th>873</th><td>874</td><td>0</td><td>3</td><td>Vander Cruyssen, Mr. Victor</td><td>male</td><td>47.0</td><td>0</td><td>0</td><td>345765</td><td>9.0000</td><td>NaN</td><td>S</td><td>Mr</td><td>1</td></tr><tr><th>874</th><td>875</td><td>1</td><td>2</td><td>Abelson, Mrs. Samuel (Hannah Wizosky)</td><td>female</td><td>28.0</td><td>1</td><td>0</td><td>P/PP 3381</td><td>24.0000</td><td>NaN</td><td>C</td><td>Mrs</td><td>2</td></tr><tr><th>875</th><td>876</td><td>1</td><td>3</td><td>Najib, Miss. Adele Kiamie "Jane"</td><td>female</td><td>15.0</td><td>0</td><td>0</td><td>2667</td><td>7.2250</td><td>NaN</td><td>C</td><td>Miss</td><td>1</td></tr><tr><th>876</th><td>877</td><td>0</td><td>3</td><td>Gustafsson, Mr. Alfred Ossian</td><td>male</td><td>20.0</td><td>0</td><td>0</td><td>7534</td><td>9.8458</td><td>NaN</td><td>S</td><td>Mr</td><td>1</td></tr><tr><th>877</th><td>878</td><td>0</td><td>3</td><td>Petroff, Mr. Nedelio</td><td>male</td><td>19.0</td><td>0</td><td>0</td><td>349212</td><td>7.8958</td><td>NaN</td><td>S</td><td>Mr</td><td>1</td></tr><tr><th>878</th><td>879</td><td>0</td><td>3</td><td>Laleff, Mr. Kristo</td><td>male</td><td>28.0</td><td>0</td><td>0</td><td>349217</td><td>7.8958</td><td>NaN</td><td>S</td><td>Mr</td><td>1</td></tr><tr><th>879</th><td>880</td><td>1</td><td>1</td><td>Potter, Mrs. Thomas Jr (Lily Alexenia Wilson)</td><td>female</td><td>56.0</td><td>0</td><td>1</td><td>11767</td><td>83.1583</td><td>C50</td><td>C</td><td>Mrs</td><td>2</td></tr><tr><th>880</th><td>881</td><td>1</td><td>2</td><td>Shelley, Mrs. William (Imanita Parrish Hall)</td><td>female</td><td>25.0</td><td>0</td><td>1</td><td>230433</td><td>26.0000</td><td>NaN</td><td>S</td><td>Mrs</td><td>2</td></tr><tr><th>881</th><td>882</td><td>0</td><td>3</td><td>Markun, Mr. Johann</td><td>male</td><td>33.0</td><td>0</td><td>0</td><td>349257</td><td>7.8958</td><td>NaN</td><td>S</td><td>Mr</td><td>1</td></tr><tr><th>882</th><td>883</td><td>0</td><td>3</td><td>Dahlberg, Miss. Gerda Ulrika</td><td>female</td><td>22.0</td><td>0</td><td>0</td><td>7552</td><td>10.5167</td><td>NaN</td><td>S</td><td>Miss</td><td>1</td></tr><tr><th>883</th><td>884</td><td>0</td><td>2</td><td>Banfield, Mr. Frederick James</td><td>male</td><td>28.0</td><td>0</td><td>0</td><td>C.A./SOTON 34068</td><td>10.5000</td><td>NaN</td><td>S</td><td>Mr</td><td>1</td></tr><tr><th>884</th><td>885</td><td>0</td><td>3</td><td>Sutehall, Mr. Henry Jr</td><td>male</td><td>25.0</td><td>0</td><td>0</td><td>SOTON/OQ 392076</td><td>7.0500</td><td>NaN</td><td>S</td><td>Mr</td><td>1</td></tr><tr><th>885</th><td>886</td><td>0</td><td>3</td><td>Rice, Mrs. William (Margaret Norton)</td><td>female</td><td>39.0</td><td>0</td><td>5</td><td>382652</td><td>29.1250</td><td>NaN</td><td>Q</td><td>Mrs</td><td>6</td></tr><tr><th>886</th><td>887</td><td>0</td><td>2</td><td>Montvila, Rev. Juozas</td><td>male</td><td>27.0</td><td>0</td><td>0</td><td>211536</td><td>13.0000</td><td>NaN</td><td>S</td><td>Rev</td><td>1</td></tr><tr><th>887</th><td>888</td><td>1</td><td>1</td><td>Graham, Miss. Margaret Edith</td><td>female</td><td>19.0</td><td>0</td><td>0</td><td>112053</td><td>30.0000</td><td>B42</td><td>S</td><td>Miss</td><td>1</td></tr><tr><th>888</th><td>889</td><td>0</td><td>3</td><td>Johnston, Miss. Catherine Helen "Carrie"</td><td>female</td><td>28.0</td><td>1</td><td>2</td><td>W./C. 6607</td><td>23.4500</td><td>NaN</td><td>S</td><td>Miss</td><td>4</td></tr><tr><th>889</th><td>890</td><td>1</td><td>1</td><td>Behr, Mr. Karl Howell</td><td>male</td><td>26.0</td><td>0</td><td>0</td><td>111369</td><td>30.0000</td><td>C148</td><td>C</td><td>Mr</td><td>1</td></tr><tr><th>890</th><td>891</td><td>0</td><td>3</td><td>Dooley, Mr. Patrick</td><td>male</td><td>32.0</td><td>0</td><td>0</td><td>370376</td><td>7.7500</td><td>NaN</td><td>Q</td><td>Mr</td><td>1</td></tr></tbody>
</table>
<p>891 rows × 14 columns</p>
</div>```python
titanic.family_size.value_counts()
```1     5372     1613     1024      296      225      157      1211      78       6Name: family_size, dtype: int64```python
def func(family_size):if family_size == 1:return 'Singleton'if family_size <= 4 and family_size >= 2:return 'SmallFamily'if family_size > 4:return 'LargeFamily'
titanic['family_type'] = titanic.family_size.apply(func)
``````python
titanic.family_type.value_counts()
```Singleton      537SmallFamily    292LargeFamily     62Name: family_type, dtype: int64

Python编程入门学习笔记(十)相关推荐

  1. Python快速编程入门#学习笔记01# |第一章 :Python基础知识 (Python发展历程、常见的开发工具、import模块导入)

    全文目录 ==先导知识== 1 认识Python 1.1.1 Python的发展历程 1.1.2 Python语言的特点 2. Python解释器的安装与Python程序运行 1.2.1 安装Pyth ...

  2. Python快速编程入门#学习笔记03# |第二章 :Python基础(代码格式、标识符关键字、变量和数据类型、数字类型以及运算符)

    全文目录 ==先导知识== 学习目标: 2.1 代码格式 2.1.1 注释 2.1.2 缩进 2.1.3 语句换行 2.2 标识符和关键字 2.2.1 标识符 2.2.2 关键字 2.3 变量和数据类 ...

  3. 01.Java 编程入门学习笔记20210307

    Java 编程入门学习笔记-day01 第0章:编程入门 1.计算机的概述 计算机 = 硬件 + 软件 1.1硬件:冯诺依曼体系 CPU: CPU的衡量标准:速度的计量单位是赫兹(Hz),1Hz相当于 ...

  4. 原创 OpenCV3编程入门 学习笔记(总)

    版权声明:本文为博主原创文章,遵循 CC 4.0 BY-SA 版权协议,转载请附上原文出处链接和本声明. 本文链接:https://blog.csdn.net/qq_36163358/article/ ...

  5. OpenCV3编程入门 学习笔记(总)

    OpenCV3编程入门 学习笔记 2018.12.12-2018.12.29 此博客为在看过毛星云版<OpenCV3编程入门>后所总结的一本笔记,可供复习使用. 文章目录 OpenCV3编 ...

  6. python基础入门学习笔记 (2)

    python基础入门学习笔记 2021年2月8日 1 编译器和解释器的区别 编译器/解释器:高级语言与机器之间的翻译官 2 值传递: print "a = ",a print &q ...

  7. python数据分析入门学习笔记

    python数据分析入门学习笔记儿 学习利用python进行数据分析的笔记儿&下星期二内部交流会要讲的内容,一并分享给大家.博主粗心大意,有什么不对的地方欢迎指正~还有许多尚待完善的地方,待我 ...

  8. 熊猫的python小课_朋友圈里那个可爱的小熊猫Python编程的学习笔记,学编程,不难!...

    Print( ) 详细请见公众号文章,里面有让人印象深刻,无法忘记的灵魂画手的图画解释.朋友圈里那个可爱的小熊猫Python编程的学习笔记,学编程,不难!​mp.weixin.qq.com 人类有人类 ...

  9. Opencv3编程入门学习笔记(五)之通道分离(split)与合并(merge)

    若要对Opencv中(BGR)颜色通道进行单一处理,那必然会涉及到通道分离(split)与合并(merge).那么本篇博客笔者记录了两个方法的使用方法和案例.案例来源于<Opencv3编程入门学 ...

最新文章

  1. javascript体系-DOM原理
  2. 设置静态ip上网_开始使用第一步:连上网线换个皮
  3. mysql查当前用户的的命令_mysql命令大全用户管理相关命令
  4. Win 7英文系统显示中文乱码的解决
  5. 分组框控件 1130
  6. linux设备驱动中的并发控制
  7. Ideal 使用帮助手册
  8. GitHub一夜爆火的阿里高并发技术小册究竟有什么魅力?
  9. Coursera-AndrewNg(吴恩达)机器学习笔记——第一周
  10. 集中火力,专项击破!数据分析可视化广深线下培训火热来袭
  11. 0-1之间的随机数 C/C++
  12. 利润表模板excel_Excel 最新财务报表分析模板(资产负债表、利润表、现金流量表…)...
  13. 查询计算机ip地址的方法,计算机的ip地址查询的几种简单方法介绍
  14. php saas,saas系统是什么
  15. MPMoviePlayerController 电影播放器—备用
  16. 2022android自定义文本路径选择器java教程
  17. 如何使用磁盘管理不删除数据合并分区?
  18. 通过配置光猫路由器实现家用主机远程桌面连接
  19. 南安一中八十年校庆征文 陈建春老师
  20. IDEA控制台输出彩色字体和彩色背景

热门文章

  1. 几款常用LED显示屏驱动芯片介绍及选择VK1624
  2. 今天再次认真整理了浏览器收藏夹
  3. 数据分析《令人心动的offer2》—你心动了吗?
  4. 程序人生 - 目前市场上各大品牌笔记本(戴尔,联想,华硕,惠普等等)的点评?
  5. 尼康d850相机参数测试软件,尼康(Nikon)D850 单机数码相机宽容度评测-ZOL中关村在线...
  6. 硬盘检测工具+linux,linux硬盘检测工具:Smartmontools使用指南
  7. androidx和android的区别,Android X 详解
  8. 多网卡情况,使用静态路由,为访问目标网络指定路径
  9. 上海和南京211计算机,高考分数线解析:上海院校vs外地211(二)
  10. 用swoole实现实时弹幕网站demo