Python Pandas 如何避免将 int 转换为浮点数的 map

发布于04月14日

I have a dictionary:

matches = {282: 285,
 266: 277,
 276: 293,
 263: 264,
 286: 280,
 356: 1371,
 373: 262,
 314: 327,
 294: 290,
 285: 282,
 277: 266,
 293: 276,
 264: 263,
 280: 286,
 1371: 356,
 262: 373,
 327: 314,
 290: 294}

还有一个df，就像这样:

现在，我试图创建一个"敌手id"列，从dict映射，如下所示:

df['adversary_id'] = df['team_id'].map(matches)

但是这个新的列敌手_id被转换为type float，两行以NaN结尾:

为什么，如果所有数据都是int类型？

我该怎么解决这个问题？

推荐答案

这是因为np.在数据帧中看到的nan或nan(它们不完全相同)值是float类型.遗憾的是，只要代码中有NaN值，就无法避免这一限制.

请阅读Pandas 文档here中的更多内容.

因为NaN是一个浮点数，所以一列整数(甚至缺少一个值)被转换为浮点数据类型(有关更多信息，请参阅对整数NA的支持).pandas提供了一个可为空的整数数组，可以通过显式请求数据类型来使用该数组:

建议的解决方案是强制type人:

df['team_id'] = pd.Series(df['team_id'],dtype=pd.Int64Dtype())

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 5 entries, 0 to 4
Data columns (total 1 columns):
 #   Column   Non-Null Count  Dtype
---  ------   --------------  -----
 0   Example  4 non-null      Int64
dtypes: Int64(1)
memory usage: 173.0 bytes