我有这种格式的数据:
term_x similar_sounding_term
insurance insurance_policy
insurance insurance_quote
insurance car_insurance
car rental rental cars in Nantucket
car rental damaged car rental policy
我想将这个数据帧转换为一个字典,其中键是term_x但是该值是该术语中所有值的列表,所以:
insurance : [insurance_policy, insurance_quote, car_insurance, rental cars in Nantucket....]
等等:
我虽然答案here解决了我的问题,但它并没有像我希望的那样将列表合并到一个键下。
答案 0 :(得分:3)
IIUC
df.groupby('term_x')['similar_sounding_term'].apply(list).to_dict()
Out[1690]:
{'carrental': ['rentalcarsinNantucket', 'damagedcarrentalpolicy'],
'insurance': ['insurance_policy', 'insurance_quote', 'car_insurance']}