【问题标题】:Extracting data out of Pandas df into a list将 Pandas df 中的数据提取到列表中
【发布时间】:2020-04-04 01:35:13
【问题描述】:

我有一个 Pandas 数据框,其标题和行包含冗余数据并希望从中提取。例如,我有一个看起来像这样的 df:

df = pd.DataFrame({'Your availability: Wednesday, December 25th, 2019 5:00AM-6:00AM': ['Wednesday, December 25th, 2019 5:00AM-6:00AM', pd.NaN, pd.NaN, 'Wednesday, December 25th, 2019 5:00AM-6:00AM'],
                'Your availability: Tuesday, December 10th 2019 8:00AM-5:00PM': [pd.NaN, 'Tuesday, December 10th 2019 8:00AM-5:00PM', pd.NaN, pd.NaN]})

...我想提取日期并将其放入字典以供参考:

datetimes = {'P1': "Wednesday, December 25th, 2019 5:00AM-6:00AM", 'P2' : "Tuesday, December 10th 2019 8:00AM-5:00PM", 'P3': NaN, 'P4': "Wednesday, December 25th, 2019 5:00AM-6:00AM}

【问题讨论】:

    标签: python-3.x pandas list datetime text-extraction


    【解决方案1】:

    IIUC,试试这个

    df.ffill(1).iloc[:,-1].rename(lambda x: f'P{x+1}').to_dict()
    
    Out[1159]:
    {'P1': 'Wednesday, December 25th, 2019 5:00AM-6:00AM',
     'P2': 'Tuesday, December 10th 2019 8:00AM-5:00PM',
     'P3': nan,
     'P4': 'Wednesday, December 25th, 2019 5:00AM-6:00AM'}
    

    【讨论】:

      【解决方案2】:

      是你想要的吗:

      df.drop_duplicates().stack().to_list()
      

      输出:

      ['Wednesday, December 25th, 2019 5:00AM-6:00AM',
       'Tuesday, December 10th 2019 8:00AM-5:00PM']
      

      【讨论】:

        猜你喜欢
        • 1970-01-01
        • 1970-01-01
        • 2023-01-30
        • 2021-05-26
        • 2018-10-12
        • 2020-05-27
        • 2020-06-02
        • 1970-01-01
        • 1970-01-01
        相关资源
        最近更新 更多