文章詳情頁

python使用pandas抽樣訓練數據中某個類別實例

瀏覽：2日期：2022-08-05 13:44:43

廢話真的一句也不想多說，直接看代碼吧！

# -*- coding: utf-8 -*- import numpy from sklearn import metrics from sklearn.svm import LinearSVC from sklearn.naive_bayes import MultinomialNB from sklearn import linear_model from sklearn.datasets import load_iris from sklearn.cross_validation import train_test_split from sklearn.preprocessing import OneHotEncoder, StandardScaler from sklearn import cross_validation from sklearn import preprocessing import scipy as spfrom sklearn.linear_model import LogisticRegressionfrom sklearn.feature_selection import SelectKBest ,chi2import pandas as pdfrom sklearn.preprocessing import OneHotEncoder#import iris_data ’’’creativeID,userID,positionID,clickTime,conversionTime,connectionType,telecomsOperator,appPlatform,sitesetID,positionType,age,gender,education,marriageStatus,haveBaby,hometown,residence,appID,appCategory,label’’’ def test(): df = pd.read_table('/var/lib/mysql-files/data1.csv', sep=',') df1 = df[['connectionType','telecomsOperator','appPlatform','sitesetID', 'positionType','age','gender','education','marriageStatus', 'haveBaby','hometown','residence','appCategory','label']] print df1['label'].value_counts() N_data = df1[df1['label']==0] P_data = df1[df1['label']==1] N_data = N_data.sample(n=P_data.shape[0], frac=None, replace=False, weights=None, random_state=2, axis=0) #print df1.loc[:,'label']==0 print P_data.shape print N_data.shape data = pd.concat([N_data,P_data]) print data.shape data = data.sample(frac=1).reset_index(drop=True) print data[['label']] return

補充拓展：pandas實現對dataframe抽樣

隨機抽樣

import pandas as pd#對dataframe隨機抽取2000個樣本pd.sample(df, n=2000)

分層抽樣

利用sklean中的函數靈活進行抽樣

from sklearn.model_selection import train_test_split#y是在X中的某一個屬性列X_train, X_test, y_train, y_test = train_test_split(X,y, test_size=0.2, stratify=y)

以上這篇python使用pandas抽樣訓練數據中某個類別實例就是小編分享給大家的全部內容了，希望能給大家一個參考，也希望大家多多支持好吧啦網。

Python 編程

上一條：使用python的turtle函數繪制一個滑稽表情下一條：如何使用repr調試python程序

相關文章：

1. XML入門的常見問題(三)2. .NET Core 分布式任務調度ScheduleMaster詳解3. 不要在HTML中濫用div4. HTML5實戰與剖析之觸摸事件(touchstart、touchmove和touchend)5. CSS清除浮動方法匯總6. HTTP協議常用的請求頭和響應頭響應詳解說明（學習）7. XML在語音合成中的應用8. ASP將數字轉中文數字(大寫金額)的函數9. XML 非法字符（轉義字符）10. jscript與vbscript 操作XML元素屬性的代碼

排行榜

					
					JavaScript函數重載操作實例淺析
PHP擴展之壓縮與歸檔擴展1——Bzip2
Android實現動態改變shape.xml中圖形的顏色
python GUI庫圖形界面開發之PyQt5滑塊條控件QSlider詳細使用方法與實例
python使用ctypes庫調用DLL動態鏈接庫
老虎身上的斑紋－－－正確使用JAVA1.5里的Annotation
Java基于注解實現的鎖實例解析
Spring EL表示式的運用@Value說明
ASP.NET MVC實現橫向展示購物車
python正則表達式re.match()匹配多個字符方法的實現
python如何利用traceback獲取詳細的異常信息