Accelerating K-means Clustering Through CUDA Parallel Computing

This repository provides a high-performance implementation of the K-Means clustering algorithm, optimized for execution on NVIDIA GPUs using CUDA. The project focuses on leveraging shared memory and parallel reduction techniques to achieve significant performance improvements over traditional CPU-based approaches.

Prerequisites

The project requires the following components:

NVIDIA GPU with Compute Capability 6.0 or higher
CUDA Toolkit (minimum version 11.0)
C++ compiler supporting CUDA (such as nvcc from the CUDA Toolkit)

Setup and Usage

Clone the repository:

git clone lorenzo-27/kmeans-cuda
cd kmeans-cuda

Configure the algorithm parameters:
- Open kmeans_config.py
- Adjust the clustering parameters according to your requirements
Compile the project:
- If using CLion with CUDA support, the build process is automatically handled.
- For manual compilation, ensure you create a cmake-build-release directory or update the executable path in kmeans.py
Run the program:

Use the Python script kmeans.py to execute the compiled binary and manage datasets and results.

Note

Upon execution, the program automatically creates two directories:

data/: Contains generated datasets
results/: Stores performance plots and analysis tables

Documentation

For a comprehensive understanding of the implementation and performance analysis, please refer to our detailed technical report available here. The report includes:

Implementation details
Performance benchmarks
Experimental results and analysis

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
TR2-kmeans.pdf		TR2-kmeans.pdf
kmeans.py		kmeans.py
kmeans_config.py		kmeans_config.py
main.cu		main.cu
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Accelerating K-means Clustering Through CUDA Parallel Computing

Prerequisites

Setup and Usage

Documentation

License

About

Releases

Packages

Languages

License

lorenzo-27/kmeans-cuda

Folders and files

Latest commit

History

Repository files navigation

Accelerating K-means Clustering Through CUDA Parallel Computing

Prerequisites

Setup and Usage

Documentation

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages