pandas查看数据

下列代码说明如何查看 DataFrame 头部和尾部数据：

In [13]: df.head()
Out[13]: A         B         C         D
2013-01-01  0.469112 -0.282863 -1.509059 -1.135632
2013-01-02  1.212112 -0.173215  0.119209 -1.044236
2013-01-03 -0.861849 -2.104569 -0.494929  1.071804
2013-01-04  0.721555 -0.706771 -1.039575  0.271860
2013-01-05 -0.424972  0.567020  0.276232 -1.087401In [14]: df.tail(3)
Out[14]: A         B         C         D
2013-01-04  0.721555 -0.706771 -1.039575  0.271860
2013-01-05 -0.424972  0.567020  0.276232 -1.087401
2013-01-06 -0.673690  0.113648 -1.478427  0.524988

显示索引与列名：

In [15]: df.index
Out[15]:
DatetimeIndex(['2013-01-01', '2013-01-02', '2013-01-03', '2013-01-04','2013-01-05', '2013-01-06'],dtype='datetime64[ns]', freq='D')In [16]: df.columns
Out[16]: Index(['A', 'B', 'C', 'D'], dtype='object')

DataFrame.to_numpy() (opens new window)输出底层数据的 NumPy 对象。注意，DataFrame (opens new window)的列由多种数据类型组成时，该操作耗费系统资源较大，这也是 Pandas 和 NumPy 的本质区别：NumPy 数组只有一种数据类型，DataFrame 每列的数据类型各不相同。调用 DataFrame.to_numpy() (opens new window)时，Pandas 查找支持 DataFrame 里所有数据类型的 NumPy 数据类型。还有一种数据类型是 object，可以把 DataFrame 列里的值强制转换为 Python 对象。

下面的 `df` 这个 DataFrame (opens new window)里的值都是浮点数，DataFrame.to_numpy() (opens new window)的操作会很快，而且不复制数据。

In [17]: df.to_numpy()
Out[17]:
array([[ 0.4691, -0.2829, -1.5091, -1.1356],[ 1.2121, -0.1732,  0.1192, -1.0442],[-0.8618, -2.1046, -0.4949,  1.0718],[ 0.7216, -0.7068, -1.0396,  0.2719],[-0.425 ,  0.567 ,  0.2762, -1.0874],[-0.6737,  0.1136, -1.4784,  0.525 ]])

`df2` 这个 DataFrame (opens new window)包含了多种类型，DataFrame.to_numpy() (opens new window)操作就会耗费较多资源。

In [18]: df2.to_numpy()
Out[18]:
array([[1.0, Timestamp('2013-01-02 00:00:00'), 1.0, 3, 'test', 'foo'],[1.0, Timestamp('2013-01-02 00:00:00'), 1.0, 3, 'train', 'foo'],[1.0, Timestamp('2013-01-02 00:00:00'), 1.0, 3, 'test', 'foo'],[1.0, Timestamp('2013-01-02 00:00:00'), 1.0, 3, 'train', 'foo']], dtype=object)

提醒：DataFrame.to_numpy() (opens new window)的输出不包含行索引和列标签。

describe()可以快速查看数据的统计摘要：

In [19]: df.describe()
Out[19]: A         B         C         D
count  6.000000  6.000000  6.000000  6.000000
mean   0.073711 -0.431125 -0.687758 -0.233103
std    0.843157  0.922818  0.779887  0.973118
min   -0.861849 -2.104569 -1.509059 -1.135632
25%   -0.611510 -0.600794 -1.368714 -1.076610
50%    0.022070 -0.228039 -0.767252 -0.386188
75%    0.658444  0.041933 -0.034326  0.461706
max    1.212112  0.567020  0.276232  1.071804

转置数据：

In [20]: df.T
Out[20]: 2013-01-01  2013-01-02  2013-01-03  2013-01-04  2013-01-05  2013-01-06
A    0.469112    1.212112   -0.861849    0.721555   -0.424972   -0.673690
B   -0.282863   -0.173215   -2.104569   -0.706771    0.567020    0.113648
C   -1.509059    0.119209   -0.494929   -1.039575    0.276232   -1.478427
D   -1.135632   -1.044236    1.071804    0.271860   -1.087401    0.524988

按轴排序：

In [21]: df.sort_index(axis=1, ascending=False)
Out[21]: D         C         B         A
2013-01-01 -1.135632 -1.509059 -0.282863  0.469112
2013-01-02 -1.044236  0.119209 -0.173215  1.212112
2013-01-03  1.071804 -0.494929 -2.104569 -0.861849
2013-01-04  0.271860 -1.039575 -0.706771  0.721555
2013-01-05 -1.087401  0.276232  0.567020 -0.424972
2013-01-06  0.524988 -1.478427  0.113648 -0.673690

按值排序：

In [22]: df.sort_values(by='B')
Out[22]: A         B         C         D
2013-01-03 -0.861849 -2.104569 -0.494929  1.071804
2013-01-04  0.721555 -0.706771 -1.039575  0.271860
2013-01-01  0.469112 -0.282863 -1.509059 -1.135632
2013-01-02  1.212112 -0.173215  0.119209 -1.044236
2013-01-06 -0.673690  0.113648 -1.478427  0.524988
2013-01-05 -0.424972  0.567020  0.276232 -1.087401

pandas查看数据相关推荐

pandas 查看数据的基本信息 DataFrame 篇
我的<Pandas & Excel 办公自动化>视频教程以下的 df 表示 DataFrame 对象 df.info(): # 打印摘要 df.describe(): # 描述性 ...
python pandas 查看数据的前五行
版本py3.6 查看约会网站数据集的前五行:
python使用sklearn中的make_classification函数生成分类模型（classification）需要的仿真数据、使用pandas查看生成数据的特征数据、目标数据
python使用sklearn中的make_classification函数生成分类模型(classification)需要的仿真数据.使用pandas查看生成数据的特征数据(features).目标 ...
07 ，pandas 基本使用：读数据，查看数据，数据类型，帮助，列名，数据形状，数据计算,，填补空缺值
1 ,pandas 主要作用 : 数据分析 2 ,pandas 底层 : numpy 3 ,数据准备 : 食品信息文件名 : food_info.csv 文件地址 : 链接:https://pan. ...
pandas 合并数据
pandas合并数据使用append.比如我要合并一个文件夹里面所有的pic类型的数据,那么先使用os模块,把所有文件名列出, 然后依次读入成dataframe, 读入过程中,一次append, 如果 ...
python数据预处理案例_对pandas进行数据预处理的实例讲解
参加kaggle数据挖掘比赛,就第一个赛题Titanic的数据,学习相关数据预处理以及模型建立,本博客关注基于pandas进行数据预处理过程.包括数据统计.数据离散化.数据关联性分析引入包和加载数据 ...
juypter 不省略_常用pandas清洗数据命令
pandas清洗数据常用命令安装jupyter notebook 启动 jupyter 可以在里面编写python代码 1. 加载数据df = pd.DataFrame(pd.read_excel( ...
使用Seaborn和Pandas进行数据可视化
目录介绍在Python中比较可视化库 Seaborn可视化类型创建一个条形图创建Seaborn折线图评论有了我们的数据集之后,我们将快速查看可以使用流行的Python库从数据集轻松创建可视 ...
pandas 不要编号加一行_文科生带你学Python|Pandas读取数据
16 2020-08 文科生带你学Python|Pandas读取数据距离上一次更新时间有点久,原因么,被大佬的代码打击到了,于是回去自闭充了一波电-- LEARN MORE 图片来自网络,如侵删向 ...

pandas查看数据

下列代码说明如何查看 DataFrame 头部和尾部数据：

显示索引与列名：

下面的 `df` 这个 DataFrame (opens new window)里的值都是浮点数，DataFrame.to_numpy() (opens new window)的操作会很快，而且不复制数据。

`df2` 这个 DataFrame (opens new window)包含了多种类型，DataFrame.to_numpy() (opens new window)操作就会耗费较多资源。

describe()可以快速查看数据的统计摘要：

转置数据：

按轴排序：

按值排序：

pandas查看数据相关推荐

最新文章

热门文章

pandas查看数据

下列代码说明如何查看 DataFrame 头部和尾部数据：

显示索引与列名：

下面的 df 这个 DataFrame (opens new window)里的值都是浮点数，DataFrame.to_numpy() (opens new window)的操作会很快，而且不复制数据。

df2 这个 DataFrame (opens new window)包含了多种类型，DataFrame.to_numpy() (opens new window)操作就会耗费较多资源。

describe()可以快速查看数据的统计摘要：

转置数据：

按轴排序：

按值排序：

pandas查看数据相关推荐

最新文章

热门文章

下面的 `df` 这个 DataFrame (opens new window)里的值都是浮点数，DataFrame.to_numpy() (opens new window)的操作会很快，而且不复制数据。

`df2` 这个 DataFrame (opens new window)包含了多种类型，DataFrame.to_numpy() (opens new window)操作就会耗费较多资源。