• Skip to primary navigation
  • Skip to content
  • Skip to footer
DDory's Study Blog
  • Categories.
    1. Home
    2. /
    3. Spark
    4. /
    5. Parquet
    Sol-Hee

    Sol-Hee

    Data Engineer

    • Seoul
    • GitHub
    • Instagram
    • Email

    Parquet

    September 6, 2021 less than 1 minute read

    Index

    • 1. DataFrame -> Parquet
    • 2. Parquet -> S3 upload

    1. DataFrame -> Parquet

    DF.to_parquet('{path}/{file_name}.parquet')
    

    2. Parquet -> S3 upload

    import awswrangler as wr
    wr.s3.to_parquet(DF, path = '{s3_path}', dataset = True)
    

    Tags: Parquet, Python, Spark

    Categories: Spark

    Updated: September 6, 2021

    Share on

    Twitter Facebook LinkedIn
    Previous Next

    You may also enjoy

    Spark 개념 및 튜닝

    February 26, 2022 1 minute read

    Context, Session

    ElasticSearch 소개

    December 27, 2021 1 minute read

    ElasticSearch란? 😵‍💫

    Docker Ubuntu, ElasticSearch 설치

    December 25, 2021 1 minute read

    Ubuntu 우분투 설치

    Multiprocessing

    December 21, 2021 less than 1 minute read

    Multiprocessing, MultiThread

    • Follow:
    • Feed
    © 2022 Sol-Hee. Powered by Jekyll & Minimal Mistakes.