MapReduce Job for converting SEGY to Parquet format
Segy2ParquetConverterMR is a MapReduce job designed to efficiently convert SEGY files to Parquet format. This conversion enables easier data handling, storage, and analysis in big data environments.
- High Performance: Utilizes the MapReduce framework for scalable and efficient processing.
- Compatibility: Supports SEGY-format (rev v1) files.
- Ease of Use: Simplifies the process of converting SEGY files to a more versatile Parquet format.
- Java 8 or higher
- Apache Hadoop
- Apache Maven
-
Clone the repository:
git clone https://github.com/lliryc/Segy2ParquetConverterMR.git
-
Navigate to the project directory:
cd Segy2ParquetConverterMR
-
Build the project using Maven:
mvn clean install
To run the conversion job, use the following command:
hadoop jar target/Segy2ParquetConverterMR-1.0.jar <input_segy_path> <output_parquet_path>
This project is licensed under the Apache-2.0 License.
For any issues or questions, please create an issue in this repository.