机器学习,数据预处理,拆分数据

时间:2020-03-02 16:40:22

标签: tensorflow machine-learning data-processing

在使用机器学习时,我具有八个功能和四个目标,我想使用一组模型来进行所有预测,在这种情况下,我想创建一个列表和一个循环的东西,以便一个模型可以使用四次。但是,我不太确定如何列出并做到这一点,有人可以帮我吗?代码附在下面

import matplotlib.pyplot as plt
import numpy as np
import sklearn
import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import MinMaxScaler
from tensorflow.python.keras.models import Sequential
from tensorflow.python.keras.layers import Dense
from tensorflow import keras

"""------------- Read all the data points in the file ----------------------------------------"""
zeolite_13X_error = pd.read_csv("zeolite_13X_error.csv", delimiter=",")  # Used for training set
zeolite_copy = zeolite_13X_error.copy()  # Used for test set
"""--------------------------------------------------------------------------------------------"""


"""------------- Filter points, Recovery rate and Purity - Test Set Only ---------------------"""
zeolite_13X_error_testset = zeolite_copy[zeolite_copy.Recovery > 0.7]
zeolite_13X_error_testset = zeolite_13X_error_testset[zeolite_13X_error_testset.Purity > 0.7]
zeolite_13X_error_testset = zeolite_13X_error_testset[zeolite_13X_error_testset.Recovery < 1.0]

# Shuffle all the data points in test set
# Anaconda 3.7 Python

zeolite_13X_error_testset = zeolite_13X_error_testset.reindex(
    np.random.permutation(zeolite_13X_error_testset.index))
# search random package, use seed package
# Extrapolate the values row by row
zeolite_13X_error_testset = zeolite_13X_error_testset.values
test_list = []
test_list.append(zeolite_13X_error_testset)

此后,我不确定如何使每个列表包含所有八个功能,而只有一个所需的目标。任何人都可以帮我一下,谢谢。

0 个答案:

没有答案