参数
- n_components
- tol
- copy
- max_iter
- noise_variance_init
- svd_method
- iterated_power
- rotation
- random_state
属性
- components_
- loglike_
- noise_variance_
- n_iter_
- mean_
- n_features_in_
- feature_names_in_
类方法
- fit(X, y=None)
- - 参数
  - 返回值
- fit_transform(X, y=None, **fit_params)
- - 参数
  - 返回值
- set_params(**params)
- get_params(deep=True)
- transform(X)
- - 参数
  - 返回值
- score(X,y=None)
- - 返回值
- score_samples(X)
- - 参数
  - 返回值
- get_precision()
- - 返回值
- get_feature_names_out(input_features=None)
- - 返回值
- get_covariance()
- - 返回值
案例：鸢尾花数据集降维

sklearn.decomposition.FactorAnalysis(n_components=None, *, tol=0.01, copy=True, max_iter=1000,\noise_variance_init=None, svd_method='randomized', iterated_power=3, rotation=None, random_state=0

参数

n_components

int, default=None
隐性因子维度，变换后得到的X的组分数量，当设置为None时，该参数将等于特征数量

tol

float, default=1e-2
似然对数增加的停止阈值

copy

bool, default=True
是否使用原数据的复制版本进行拟合，如果设置为否，原数据将会在拟合过程中被改变

max_iter

int, default=1000
最大迭代次数

noise_variance_init

ndarray of shape (n_features,), default=None
初始化每个特征的噪声矩阵，当设置为None时，该参数将等于np.ones(n_features)

svd_method

{‘lapack’, ‘randomized’}, default=’randomized’

方法	描述
‘lapack’	standard SVD from scipy.linalg
‘randomized’	fast randomized_svd function

默认使用 ‘randomized’ ，因为在大多数应用场景下随机化将足够精确，同时提供显著的速度增益，而且精度也可以通过设置更高的迭代值来提高，当然你也可以使用 ‘lapack’以得到更高的精度

iterated_power

int, default=3
幂方法迭代次数，只有svd_method= ‘randomized’时生效

rotation

{‘varimax’, ‘quartimax’}, default=None
实现因子旋转，目前支持varimax 和 quartimax两种旋转方法

random_state

int or RandomState instance, default=0
当参数svd_method=‘randomized’时使用，传递一个int值，以在多个函数调用之间获得可重复的结果

属性

components_

ndarray of shape (n_components, n_features)
载荷矩阵

loglike_

list of shape (n_iterations,)
每次迭代的似然对数

noise_variance_

ndarray of shape (n_features,)
每个特征的噪声方差

n_iter_

int
迭代次数

mean_

ndarray of shape (n_features,)
根据训练数据集估计得到的前验均值

n_features_in_

int
拟合过程中使用到的特征数量

feature_names_in_

ndarray of shape (n_features_in_,)
拟合过程中使用到的特征名称。只当输入数据有数据类型为str的特征名称时才有该属性

类方法

fit(X, y=None)

利用基于奇异值分解的方法将因子分析模型拟合到X

参数

X
array-like of shape (n_samples, n_features)
训练数据

返回值

object
因子分析类实例

fit_transform(X, y=None, **fit_params)

Fit to data, then transform it.
拟合并转换数据。利用可选参数fit_params转换x、y，返回被转换后的x
Fits transformer to X and y with optional parameters fit_params and returns a transformed version of X.

参数

X
array-like of shape (n_samples, n_features)
输入样本

y
array-like of shape (n_samples,) or (n_samples, n_outputs), default=None
标签数据

**fit_paramsdict
额外的拟合参数
Additional fit parameters.

返回值

X_new
ndarray array of shape (n_samples, n_features_new)
转换后的数组

set_params(**params)

设置模型参数，params：类字典数据

get_params(deep=True)

返回模型参数

transform(X)

使用模型对X进行降维，计算潜在变量的期望平均值

参数

X
array-like of shape (n_samples, n_features)
训练数据

返回值

X_new
ndarray of shape (n_samples, n_components)
降维后的X

score(X,y=None)

根据模型计算输入样本的平均似然对数

返回值

float
当前模型下样本的平均似然对数。
Average log-likelihood of the samples under the current model.

score_samples(X)

计算每个样本的对数似然
Compute the log-likelihood of each sample.

参数

X
ndarray of shape (n_samples, n_features)
数据

返回值

ndarray of shape (n_samples,)

当前模型下每个样本的平均似然对数。
Log-likelihood of each sample under the current model.

get_precision()

利用因子分析模型计算数据精度矩阵。

返回值

ndarray of shape (n_features, n_features)
精度矩阵

get_feature_names_out(input_features=None)

Get output feature names for transformation.
获取转换的输出特征名称。

返回值

feature_names_out
ndarray of str objects
转换后的特征名称

get_covariance()

用因子分析模型计算数据协方差

返回值

ndarray of shape (n_features, n_features)
数据的协方差矩阵

案例：鸢尾花数据集降维

from sklearn.datasets import load_digits
from sklearn.decomposition import FactorAnalysis
x,_=load_digits(return_X_y=True)
transformer=FactorAnalysis(n_components=7)
transformed=transformer.fit_transform(x)
transformed.shape,x.shape
>>> ((1797, 7), (1797, 64))
transformer.n_features_in_
>>> 64

因子分析--FactorAnalysis类相关推荐

Python 无监督学习实用指南：6~10
原文:Hands-on unsupervised learning with Python 协议:CC BY-NC-SA 4.0 译者:飞龙本文来自[ApacheCN 深度学习译文集],采用译后编 ...
python聚类算法中x是多维、y是一维怎么画图_基于Python的数据可视化：从一维到多维...
目录一.iris数据集介绍二.一维数据可视化三.二维数据可视化四.多维数据可视化五.参考资料一.iris数据集介绍 iris数据集有150个观测值和5个变量,分别是sepal length ...
自己搜集编写的Delphi 通用函数
{ ********************************************************************** } { Currency Common Functio ...
多元统计分析——数据降维——因子分析（FA）
一.因子分析简介 1.定义 1904年,英国心理学家CharlesSpearman研究了33名学生在古典语.法语和英语三门成绩,三门成绩的相关性系数如下: 三门成绩的高度相关会不会是由于它们三个成绩的 ...
因子分析factor analysis_spss运用_python建模(推荐AAA)
sklearn实战-乳腺癌细胞数据挖掘(博主亲自录制视频) https://study.163.com/course/introduction.htm?courseId=1005269003& ...
主成分分析、因子分析、聚类分析的比较与应用
听说这是一篇论文不过我没详细看. 一.概述主成分分析就是将多项指标转化为少数几项综合指标,用综合指标来解释多变量的方差- 协方差结构.综合指标即为主成分.所得出的少数几个主成分,要尽可能多地保留原 ...
主成分分析和因子分析及其在R中的…
1 主成分分析和因子分析比较主成分分析和探索性因子分析是两种用来探索和简化多变量复杂关系的常用方法,它们之间有联系也有区别. 主成分分析(PCA)是一种数据降维方法,它能将大量相关变量转化为一组很 ...
因子分析——python
目录一.起源二.基本思想三.算法用途四.因子分析步骤五.factor_analyzer库四.实例详解 1.导入库 2.读取数据 3.充分性检测 3.1 Bartlett's球状检验 3.2 ...
主成分分析、因子分析和聚类分析的区别
主成分分析就是设法将原来众多具有一定相关性(比如P个指标),重新组合成一组新的互相无关的综合指标来代替原来的指标. 综合指标即为主成分.所得出的少数几个主成分,要尽可能多地保留原始变量的信息,且彼此不 ...

因子分析--FactorAnalysis类

目录

参数