Skip to content

lliryc/Segy2ParquetConverterMR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Segy2ParquetConverterMR

MapReduce Job for converting SEGY to Parquet format

Overview

Segy2ParquetConverterMR is a MapReduce job designed to efficiently convert SEGY files to Parquet format. This conversion enables easier data handling, storage, and analysis in big data environments.

Features

  • High Performance: Utilizes the MapReduce framework for scalable and efficient processing.
  • Compatibility: Supports SEGY-format (rev v1) files.
  • Ease of Use: Simplifies the process of converting SEGY files to a more versatile Parquet format.

Requirements

  • Java 8 or higher
  • Apache Hadoop
  • Apache Maven

Installation

  1. Clone the repository:

    git clone https://github.com/lliryc/Segy2ParquetConverterMR.git
  2. Navigate to the project directory:

    cd Segy2ParquetConverterMR
  3. Build the project using Maven:

    mvn clean install

Usage

To run the conversion job, use the following command:

hadoop jar target/Segy2ParquetConverterMR-1.0.jar <input_segy_path> <output_parquet_path>

License

This project is licensed under the Apache-2.0 License.

Contact

For any issues or questions, please create an issue in this repository.

About

MapReduce Job for converting SEGY to Parquet format

Resources

License

Stars

Watchers

Forks

Languages