我有以下代码,并且我尝试将True
分配给新列,其中实际日期等于列中的日期' D' (创建日期)和False
任何其他。
我对Python很陌生,所以我想了解我做错了什么:
def GetData():
myList = GetFileList(TodaysDate,5)
NewDataFrame = pd.DataFrame()
for x in myList:
#The date of the actuals data is the day BEFORE it was created
ActualDate = getDate(x) - timedelta(days=1)
myTempData = pd.read_csv(WSIWeatherDir + "\\" + x, parse_dates = [" date"], date_parser = DateTimeParser)
myTempData = myTempData.replace(-99,np.nan)
myTempData = myTempData.loc[myTempData['name'].isin(NL_WeatherStations)]
myTempData['D'] = myTempData[' date'].dt.date
#MyData = myTempData.sort([' date'], ascending=True)
#print MyData
#Select indices of the weather file where the column " date" is equal to the actual date we are looking for
MyActualIndex = myTempData['D'] == ActualDate
MyActualData = myTempData[MyActualIndex]
MyExpectedIndex = myTempData.index.difference(MyActualData.index)
MyExpectedData = myTempData.loc[MyExpectedIndex]
myTempData ['Actuals'] = [True] * len(MyActualData.index)
myTempData ['Actuals'] = [False] * len(MyExpectedData.index)
NewDataFrame = pd.concat([NewDataFrame,myTempData])
return NewDataFrame
print GetData()
错误
runfile('C:/Users//Desktop/NLG_TAC_Calculation/TAC_2.py', wdir='C:/Users//Desktop/NLG_TAC_Calculation')
Traceback (most recent call last):
File "<ipython-input-4-c9c151bca95a>", line 1, in <module>
runfile('C:/Users//Desktop/NLG_TAC_Calculation/TAC_2.py', wdir='C:/Users//Desktop/NLG_TAC_Calculation')
File "C:\Python27\lib\site-packages\spyderlib\widgets\externalshell\sitecustomize.py", line 685, in runfile
execfile(filename, namespace)
File "C:\Python27\lib\site-packages\spyderlib\widgets\externalshell\sitecustomize.py", line 71, in execfile
exec(compile(scripttext, filename, 'exec'), glob, loc)
File "C:/Users//Desktop/NLG_TAC_Calculation/TAC_2.py", line 117, in <module>
print GetData()
File "C:/Users//Desktop/NLG_TAC_Calculation/TAC_2.py", line 108, in GetData
myTempData ['Actuals'] = [True] * len(MyActualData.index)
File "C:\Python27\lib\site-packages\pandas\core\frame.py", line 2127, in __setitem__
self._set_item(key, value)
File "C:\Python27\lib\site-packages\pandas\core\frame.py", line 2204, in _set_item
value = self._sanitize_column(key, value)
File "C:\Python27\lib\site-packages\pandas\core\frame.py", line 2362, in _sanitize_column
value = _sanitize_index(value, self.index, copy=False)
File "C:\Python27\lib\site-packages\pandas\core\series.py", line 2579, in _sanitize_index
raise ValueError('Length of values does not match length of '
ValueError: Length of values does not match length of index
答案 0 :(得分:7)
我最好的猜测依赖于这一部分:
myTempData ['Actuals'] = [True] * len(MyActualData.index)
myTempData ['Actuals'] = [False] * len(MyExpectedData.index)
首先说myTempData['Actuals']
是一个大小为len(MyActualData.index)
的列,仅包含True
个值。接下来,它会替换包含len(MyExpectedData.index)
值的另一个大小为False
的列(我希望它不同)。
您可以先创建一列True
值,然后再替换False
个:
myTempData['Actuals'] = True
myTempData.iloc[MyExpectedIndex] = False