【问题标题】:How to compile and save multiple Excel files in one Excel file in AWS Glue如何在 AWS Glue 中将多个 Excel 文件编译并保存在一个 Excel 文件中
【发布时间】:2023-01-03 21:33:09
【问题描述】:

我能够生成多个 Excel 文件并在我的本地计算机上使用 pandas.ExcelWriter 对其进行编译,但如何使用 AWS Glue 获得相同的结果?

def backup_report(filename):
    with pd.ExcelWriter(local_excel_file_path + '/{0}.xlsx'.format(filename), engine='xlsxwriter') as writer:
    # -------  insert metrics that x need calculation -------
        op_AdReq_fnl.to_excel(writer, sheet_name='weekly', index=True, startcol=0, startrow=2, header=True)
        op_dirt_imps.to_excel(writer, sheet_name='weekly', index=True, startcol=0, startrow=8, header=False)
        op_prog_imps.to_excel(writer, sheet_name='weekly', index=True, startcol=0, startrow=12, header=False)
        op_hse_imps.to_excel(writer, sheet_name='weekly', index=True, startcol=0, startrow=16, header=False)
        op_FillRt_fnl.to_excel(writer, sheet_name='weekly', index=True, startcol=0, startrow=29, header=False)
        op_dirt_rvn.to_excel(writer, sheet_name='weekly', index=True, startcol=0, startrow=57, header=False)
        op_prog_rvn.to_excel(writer, sheet_name='weekly', index=True, startcol=0, startrow=61, header=False)

        workbook  = writer.book
        worksheet = writer.sheets['weekly']

     # -------  report date -------
        update_dt = date.today().strftime("%d %b %Y")
        worksheet.write(0,0,'Last Update: '+ update_dt)

        report_dt = start_dt + " - " + end_dt
        worksheet.write(0, 1, '('+ report_dt + ')')

    # -------  write index names -------
        metric_format = workbook.add_format({'bold': False, 'font_color': 'black', 'align': 'left', 'valign': 'vcenter'})
        metric_format.set_border()
        metric_format2 = workbook.add_format({'bold': True, 'font_color': 'black', 'align': 'left', 'valign': 'vcenter'})
        metric_format2.set_border()
        metric_format2.set_bg_color('silver')

        fmt_number = workbook.add_format({'num_format': '#,,##0'})
        fmt_percent = workbook.add_format({'num_format': '0%'})

    writer.save()
    writer.close()

【问题讨论】:

  • 抱歉,我正在尝试在 Glue 上运行此脚本

标签: python excel pandas amazon-web-services aws-glue


【解决方案1】:

我在 AWS Glue 中成功使用了pd.Excelwriter

def backup_report(filename):
    with io.BytesIO() as output:
        with pd.ExcelWriter(output, engine='xlsxwriter') as writer:
             .
             .
             .
    writer.save()
    writer.close()

【讨论】:

    【解决方案2】:

    早上好,但是您如何或在哪里设置 S3 存储桶的最终路径?

    【讨论】:

    猜你喜欢
    • 1970-01-01
    • 1970-01-01
    • 2023-03-09
    • 2018-11-12
    • 1970-01-01
    • 1970-01-01
    • 1970-01-01
    • 1970-01-01
    • 2019-11-04
    相关资源
    最近更新 更多