drop_duplicates()函数执行常见的数据清理任务,该任务处理DataFrame中的重复值。此方法有助于从DataFrame中删除重复的值。
DataFrame.drop_duplicates(subset=None, keep='first', inplace=False)
根据传递的参数,它返回删除了重复行的DataFrame。
import pandas as pd emp = {"Name": ["Parker", "Learnfk", "William", "Parker"], "Age": [21, 32, 29, 21]} info = pd.DataFrame(emp) print(info)
输出
Name Age 0 Parker 21 1 Learnfk 32 2 William 29 3 Parker 21
import pandas as pd emp = {"Name": ["Parker", "Learnfk", "William", "Parker"], "Age": [21, 32, 29, 21]} info = pd.DataFrame(emp) info = info.drop_duplicates() print(info)
输出
Name Age 0 Parker 21 1 Learnfk 32 2 William 29
祝学习愉快!(内容编辑有误?请选中要编辑内容 -> 右键 -> 修改 -> 提交!)