python——pandas库之Series数据结构基础

文章目录

pandas之Series
- 1.Series简介
- 2.Series属性
- 3.数学运算
- 4.缺失值检测
- 6.Series自动对齐索引
- 7.name属性

pandas之Series

1.Series简介

Series是一种一维的数组型对象，它包含了一个数值和数据标签，称为索引

import pandas as pd
import numpy as np

obj = pd.Series([4, 7, -5, 3])
obj

OUT:

0    4
1    7
2   -5
3    3
dtype: int64

2.Series属性

通过valus属性和index属性分别获得Series对象的值和索引

obj.values

OUT:

array([ 4,  7, -5,  3], dtype=int64)

obj.index # 与range()相似

OUT:

RangeIndex(start=0, stop=4, step=1)

可以建立一个索引数列，用标签来标识每个数据点

obj2 = pd.Series([4, 7, -5, 3], ['a', 'b', 'c', 'd'])
obj2

OUT:

a    4
b    7
c   -5
d    3
dtype: int64

可以使用标签来进行索引

obj2['a']

OUT:

obj2[['a', 'b', 'c']]

OUT:

a    4
b    7
c   -5
dtype: int64

3.数学运算

支持数学运算和数学函数

obj2[obj2 > 0]

OUT:

a    4
b    7
d    3
dtype: int64

obj2 * 2

OUT:

a     8
b    14
c   -10
d     6
dtype: int64

np.exp(obj2)

OUT:

a      54.598150
b    1096.633158
c       0.006738
d      20.085537
dtype: float64

5.Series也可以看成是一个字典

'a' in obj2

OUT:
True

'e' in obj2

OUT:

False

可以使用字典生成Series

sdata = {'a' : 1, 'c' : 3, 'b' : 2, 'd' : 4}
obj3 = pd.Series(sdata)
obj3

OUT:

a    1
c    3
b    2
d    4
dtype: int64

自定义字典键的顺序

states = ['a', 'b', 'c', 'e']
obj4 = pd.Series(sdata, states)
obj4

OUT:

a    1.0
b    2.0
c    3.0
e    NaN
dtype: float64

在数据没有出现过的标签对应的值为NAN,因为‘d’不再states中，它被排除在结果对象外

4.缺失值检测

pandas中使用isnull和notnull来检查缺失数据

pd.isnull(obj4)

OUT:

a    False
b    False
c    False
e     True
dtype: bool

pd.notnull(obj4)

OUT:

a     True
b     True
c     True
e    False
dtype: bool

obj4.isnull()

OUT:

a    False
b    False
c    False
e     True
dtype: bool

obj4.notnull()

OUT:

a     True
b     True
c     True
e    False
dtype: bool

6.Series自动对齐索引

如果没有规定索引顺序的话，新产生的Series对象的索引时排序好的字典键。如：obj3+obj4

obj3

OUT:

a    1
c    3
b    2
d    4
dtype: int64

obj4

OUT:

a    1.0
b    2.0
c    3.0
e    NaN
dtype: float64

obj3 + obj4

OUT:

a    2.0
b    4.0
c    6.0
d    NaN
e    NaN
dtype: float64

7.name属性

Series对象自身和索引都有name属性

obj4.name = 'population'
obj4.index.name = 'state'
obj4

OUT:

state
a    1.0
b    2.0
c    3.0
e    NaN
Name: population, dtype: float64

Series对象的索引可以通过按位置赋值的方式进行改变

obj

OUT:

0    4
1    7
2   -5
3    3
dtype: int64

obj.index = ['a', 'b', 'c', 'd']
obj

OUT:

a    4
b    7
c   -5
d    3
dtype: int64

python——pandas库之Series数据结构基础相关推荐

python——pandas库之DataFrame数据结构基础
DataFrame简介 dataframe表示的是矩阵的数据表,每一列可以是不同的值类型,可以看作一个共享相同索引的Series字典.在dataframe中,数据被存储为一个以上的二维块 1.利用等长 ...
Python pandas库｜任凭弱水三千，我只取一瓢饮（1）
对Python的 pandas 库所有的内置元类.函数.子模块等全部浏览一遍,然后挑选一些重点学习一下.我安装的库版本号为1.3.5,如下: >>> import pandas as ...
Python Pandas 的使用——Series
Python Pandas 的使用--Series Pandas是一个强大的分析结构化数据的工具集:它的使用基础是Numpy(提供高性能的矩阵运算):用于数据挖掘和数据分析,同时也提供数据清洗功能. ...
Python pandas库｜任凭弱水三千，我只取一瓢饮（5）
上一篇链接: Python pandas库|任凭弱水三千,我只取一瓢饮(4)_Hann Yang的博客-CSDN博客 S~W: Function46~56 Types['Function'][45: ...
Python pandas库｜任凭弱水三千，我只取一瓢饮（4）
上一篇链接: Python pandas库|任凭弱水三千,我只取一瓢饮(3)_Hann Yang的博客-CSDN博客 R(read_系列2): Function36~45 Types['Funct ...
Python pandas库｜任凭弱水三千，我只取一瓢饮（7）
上一篇链接: Python pandas库|任凭弱水三千,我只取一瓢饮(6)_Hann Yang的博客-CSDN博客 to_系列函数:22个 (12~22) Function12 to_numpy(s ...
Python pandas库｜任凭弱水三千，我只取一瓢饮（6）
上一篇链接: Python pandas库|任凭弱水三千,我只取一瓢饮(5)_Hann Yang的博客-CSDN博客 DataFrame 类方法(211个,其中包含18个子类.2个子模块) >& ...
Python pandas库｜任凭弱水三千，我只取一瓢饮（3）
上一篇链接: Python pandas库|任凭弱水三千,我只取一瓢饮(2)_Hann Yang的博客-CSDN博客 R(read_系列1): Function26~35 Types['Functi ...
Python pandas库｜任凭弱水三千，我只取一瓢饮（2）
上一篇链接: Python pandas库|任凭弱水三千,我只取一瓢饮(2)_Hann Yang的博客-CSDN博客 I~Q: Function10~25 Types['Function'][9:2 ...

python——pandas库之Series数据结构基础

文章目录

pandas之Series

1.Series简介

2.Series属性

3.数学运算

4.缺失值检测

6.Series自动对齐索引

7.name属性

python——pandas库之Series数据结构基础相关推荐

最新文章

热门文章