Question

新手在这里 - 我的第一次尝试似乎没问题，但这是我第二次使用大熊猫。在Windows 7上使用Pandas 0.12.0时，我从SQL中读取了2个数据帧一个按预期使用groupby，所以我确定我的问题不是语法。但另一方面，type（reddf）返回pandas.core.frame.DataFrame，当我尝试reddf.groupby（'any column'）时，我得到 - 最后几行 -

    c:\python27\lib\site-packages\pandas\core\groupby.pyc in __init__(self, index, grouper,     name, level, sort)
   1197             # no level passed
   1198             if not isinstance(self.grouper, np.ndarray):
-> 1199                 self.grouper = self.index.map(self.grouper)
   1200                 if not (hasattr(self.grouper,"__len__") and \
   1201                    len(self.grouper) == len(self.index)):

c:\python27\lib\site-packages\pandas\algos.pyd in pandas.algos.arrmap_int64 (pandas\algos.c:62839)()

TypeError: 'DataFrame' object is not callable

我知道 groupby 是正常的，并且该列存在，因此数据框上还有一些其他约束/条件，我只是不知道或已经过去了。那么什么可能导致这个错误？我该怎么办？我将来应该寻找什么？

请求信息

print type(reddf.index)
<class 'pandas.core.index.Int64Index'>

print repr(reddf.index) 
Int64Index([0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19], dtype=int64)

print type(reddf.index.map)
<type 'instancemethod'>

print repr(reddf.index.map)
<bound method Int64Index.map of Int64Index([0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19], dtype=int64)>

Just in case
reddf gives
<class 'pandas.core.frame.DataFrame'>
Int64Index: 20 entries, 0 to 19
Data columns (total 24 columns):
AssetId                  20  non-null values
DateAdded                20  non-null values
ModelId                  20  non-null values
UsageTypeId              20  non-null values
DateAdded                20  non-null values
Name                     20  non-null values
NatureId                 20  non-null values
IsContainer              20  non-null values
SparePartNumber          8  non-null values
ProductNumber            19  non-null values
SupportCategoryOid       20  non-null values
SerialNumber             20  non-null values
IpAddress                20  non-null values
Description              20  non-null values
CustomsId                15  non-null values
AssetTag                 20  non-null values
ParentId                 5  non-null values
ManagementProcessorId    7  non-null values
OperatingSystem          20  non-null values
OsVersion                20  non-null values
SystemName               20  non-null values
LocationId               10  non-null values
RomVersion               20  non-null values
MacAddress               19  non-null values
dtypes: bool(1), datetime64[ns](2), float64(3), int64(5), object(13)

我特别是在做一个reddf.groupby（'ModelId'）的错误。感谢

感谢大家，重复的字段名称引起了我的问题，我不敢相信我之前没有注意到最后的评论。

现在，我不明白.index输出如何消除其他问题，你能详细说明吗？如果索引丢失怎么办，不应该groupby能够正常运行，为什么不呢？只是寻找一个简短的解释，如果你指向代码，那很好。感谢帮助，伙计们。

Answer 1

是由“DateAdded”列重复造成的。重命名它，你很高兴。

Answer 2

仅供参考，重复的列名称不应再导致此错误。如果您正在使用最新的熊猫，那么此错误是由其他原因引起的。

请参阅：https://github.com/pandas-dev/pandas/pull/8210

groupby - TypeError'DataFrame'对象不可调用

2 个答案: