下面的简单实现从上面继承 - 但显示过滤掉特定列中的 nan 行 - 就地 - 并用于 large 数据框按列名计数 nan 的行(之前和之后)
import pandas as pd
import numpy as np
df = pd.DataFrame([[1,np.nan,'A100'],[4,5,'A213'],[7,8,np.nan],[10,np.nan,'GA23']])
df.columns = ['areaCode','Distance','accountCode']
数据框
areaCode Distance accountCode
1 NaN A100
4 5.0 A213
7 8.0 NaN
10 NaN GA23
之前:用nan计算行数(每列):
df.isnull().sum()
按列计数:
areaCode 0
Distance 2
accountCode 1
dtype: int64
就地删除不需要的行:
df.dropna(subset=['Distance'],inplace=True)
之后:用 nan 计算行数(每列):
df.isnull().sum()
按列计数:
areaCode 0
Distance 0
accountCode 1
dtype: int64
数据框:
areaCode Distance accountCode
4 5.0 A213
7 8.0 NaN