机器学习中的sklearn中的聚类数据生成器 - 代码天地

机器学习中的sklearn中的聚类数据生成器

其他 2018-12-09 20:46:34 阅读次数: 0

版权声明：未经同意窃取和转载我的内容，如果涉及到权益问题，后果自负！ https://blog.csdn.net/weixin_41605937/article/details/84308484

参数的意思：

n_samples: int, optional (default=100)待生成的样本的总数。
n_features: int, optional (default=2)每个样本的特征数。
centers: int or array of shape [n_centers, n_features], optional (default=3)要生成的样本中心（类别）数，或者是确定的中心点。
cluster_std: float or sequence of floats, optional (default=1.0)每个类别的方差，例如我们希望生成2类数据，其中一类比另一类具有更大的方差，可以将cluster_std设置为[1.0,3.0]。
center_box: pair of floats (min, max), optional (default=(-10.0, 10.0))
shuffle: boolean, optional (default=True)
random_state:

return：

X : array of shape [n_samples, n_features]
The generated samples.
生成的样本数据集。
y : array of shape [n_samples]
The integer labels for cluster membership of each sample.

1）make_classification

sklearn.datasets.make_classification(n_samples=100, n_features=20, n_informative=2, n_redundant=2,
                   n_repeated=0, n_classes=2, n_clusters_per_class=2, weights=None,
                    flip_y=0.01, class_sep=1.0, hypercube=True,shift=0.0, scale=1.0,
                   shuffle=True, random_state=None)

通常用于分类算法。
n_features :特征个数= n_informative（） + n_redundant + n_repeated
n_informative：多信息特征的个数
n_redundant：冗余信息，informative特征的随机线性组合
n_repeated ：重复信息，随机提取n_informative和n_redundant 特征
n_classes：分类类别
n_clusters_per_class ：某一个类别是由几个cluster构成的

样本数据集的标签。
2）make_circles and make_moons

sklearn.datasets.make_circles(n_samples=100, shuffle=True, noise=None, random_state=None, factor=0.8)

3）make_gaussian_quantiles 和make_hastie_10_2

sklearn.datasets.make_gaussian_quantiles(mean=None, cov=1.0, n_samples=100, n_features=2, n_classes=3,
shuffle=True, random_state=None)

猜你喜欢

转载自blog.csdn.net/weixin_41605937/article/details/84308484

机器学习中的sklearn中的聚类数据生成器

make_blobs聚类数据生成器【转】

机器学习中的聚类

机器学习：sklearn样本生成器，make_blob(), make_classification()

生成器中取值

Python中的生成器

python 中的生成器

聚类算法数据生成器make_blobs

【scikit-learn】06：make_blobs聚类数据生成器

Python中SKlearn中kmeans聚类

语义分割中的数据生成器dataloader(pytorch版)

Python机器学习--算法实现--常用算法在Sklearn中的聚类算法和分类算法关键参数详解

Python中的生成式与生成器

【人工智能】机器学习之聚类算法Kmeans及其应用，调用sklearn中聚类算法以及手动实现Kmeans算法。

sklearn 中的MiniBatchKMeans(聚类)使用

sklearn 中的聚类方法的使用

14、Python中的生成器

生成器中yield 与 return

python中tokens生成器

Python中的迭代器、生成器

Python中的生成器（generator）

python中,迭代器与生成器

Python中的生成器与迭代器

python中的yield生成器详解

Python中的迭代器与生成器

标准库中的生成器函数

python中的生成器(一)

shell中的awk报告生成器

python中的生成器(二)

dart中的生成器函数

今日推荐

技术解析 GPT-4o：即时语音交互的突破与 GenAI 发展策略

开源大模型与闭源大模型

微信小程序授权登录获取用户的openid

亿级流量系统架构设计与实战

人工智能时代的程序设计教学与课程设计

纽交所技术问题致伯克希尔 (BRK.A) 显示跌近 100%

探索 api.maynor1024.live：一站式 AI 服务平台

AI一键去衣技术：窥见深度学习在图像处理领域的革命(最后有彩蛋)

艾体宝案例 | 使用Redis和Spring Ai构建rag应用程序

Apple M1 vs 高通8Gen2 vs Apple A12Z各方面比较

【升职加薪必备架构图】Springboot学习路线汇总_springboot四层架构流程图

与Apollo共创生态：Apollo7周年大会自动驾驶生态利剑出鞘

周排行

tensorflow 笔记：二（北大）

fork函数详解

unity单利模板

mac下的特殊键位指引（转自apple）

c语言入门-注释

Python--多任务[线程，进程，协程]

深度对抗学习在图像分割和超分辨率中的应用

【转】【Maven】Project configuration is not up-to-date with pom.xml错误解决方法

基本数据类型与常量池

部署自己的Intell项目的经历

每日归档

更多

2024-06-07(0)

2024-06-06(0)

2024-06-05(0)

2024-06-04(10)

2024-06-03(52)

2024-06-02(4)

2024-06-01(60)

2024-05-31(47)

2024-05-30(4)

2024-05-29(65)