AI Code Summary is an open-source tool that automates the process of aggregating code files from a directory into a single markdown file. It intelligently skips files specified in the .gitignore
and leverages OpenAI's GPT models to generate concise summaries for each code file. The final markdown file is both comprehensive and easy to understand, making it suitable for use in OpenAI Assistant or any Retrieval-Augmented Generation (RAG) model.
By automating code summarization and aggregation, AI Code Summary streamlines documentation efforts and enhances codebase comprehension. It saves developers time by reducing the manual effort required to create summaries, helps in onboarding new team members, and facilitates code reviews by providing clear overviews of the code structure and functionality.
- Open Source: Freely available and community-driven.
- Automated Code Summarization: Utilizes OpenAI's GPT models to generate concise summaries of code files.
- Markdown Aggregation: Combines code and summaries into a single, well-structured markdown file.
- Gitignore Aware: Skips files and directories specified in
.gitignore
. - Customizable Prompts: Allows customization of the summary prompt used by the AI model.
- Easy Integration: Installable via PyPI and integrable into existing workflows.
- Testing Suite: Includes unit and end-to-end tests to ensure reliability.
- Programming Language: Python 3.11+
- Frameworks and Libraries:
- OpenAI API: For generating code summaries.
- Pathspec: For parsing
.gitignore
patterns. - Loguru: For logging.
- Python-dotenv: For environment variable management.
- Hatch: For environment management and packaging.
- Pytest: For testing.
- Ruff: For linting and code style enforcement.
AI Code Summary is published on PyPI and can be easily installed using pip
:
pip install ai-code-summary
For more details, visit the PyPI project page.
-
Clone the repository:
git clone https://github.com/DEV3L/ai-code-summary.git cd ai-code-summary
-
Set up environment variables:
Copy the
env.default
file to.env
and replace placeholders with your actual OpenAI API key:cp env.default .env
Edit
.env
to add yourOPENAI_API_KEY
:OPENAI_API_KEY=your_openai_api_key
-
Set up a virtual environment:
Install Hatch (if not already installed):
pip install hatch
Create and activate the virtual environment:
hatch env create hatch shell
-
Install dependencies:
pip install -r requirements.txt
Note: Dependencies are managed via
hatch
and specified inpyproject.toml
.
To generate a markdown summary of your code files:
-
Ensure your OpenAI API key is set in the
.env
file. -
Run the following script:
from ai_code_summary.markdown.export import create_markdown_from_code if __name__ == "__main__": create_markdown_from_code(".")
Alternatively, you can run the end-to-end script provided:
python run_end_to_end.py
This will generate a markdown file summarizing the code in the current directory.
An example output file is available at ai-code-summary.md.
-
Run End-to-End Test:
hatch run e2e
-
Run Unit Tests:
hatch run test
-
Publish Package to PyPI:
hatch run publish
Note: These scripts are defined in pyproject.toml
under [tool.hatch.envs.default.scripts]
.
Run the end-to-end test to ensure the tool works as expected:
hatch run e2e
To run unit tests:
hatch run test
Coverage reports are generated using pytest-cov
.
To monitor code coverage in VSCode:
-
Install the Coverage Gutters extension.
-
Run:
Command + Shift + P => Coverage Gutters: Watch
ai-code-summary/
├── ai_code_summary/
│ ├── ai/
│ │ └── summary.py
│ ├── code/
│ │ └── gitignore_pathspec.py
│ ├── files/
│ │ └── file_manager.py
│ ├── markdown/
│ │ └── export.py
│ └── env_variables.py
├── tests/
│ ├── ai/
│ │ └── summary_test.py
│ ├── code/
│ │ └── gitignore_pathspec_test.py
│ ├── files/
│ │ └── file_manager_test.py
│ └── markdown/
│ └── export_test.py
├── .env.default
├── pyproject.toml
├── README.md
├── run_end_to_end.py
├── LICENSE
- ai_code_summary/: Main package containing the code.
- ai/: Functions related to AI summarization.
- code/: Handles
.gitignore
parsing. - files/: Manages file operations.
- markdown/: Generates markdown files.
- env_variables.py: Manages environment variables.
- tests/: Contains unit tests for the code.
- .env.default: Template for environment variables.
- pyproject.toml: Project configuration and dependencies.
- run_end_to_end.py: Script to execute the end-to-end process.
- LICENSE: Project license information.
We welcome contributions! Please follow these steps:
-
Fork the repository on GitHub.
-
Create a new branch for your feature or bugfix:
git checkout -b feature/your-feature-name
-
Make your changes and commit them with clear messages.
-
Run tests to ensure nothing is broken:
hatch run test
-
Push to your fork and submit a pull request to the
main
branch.
Note: Please provide a CONTRIBUTING.md
file with detailed contributing guidelines if available.
This project is licensed under the MIT License. See the LICENSE file for details.
- Justin Beall - Creator and maintainer of the project.
- OpenAI - For providing the GPT models used in code summarization.
- Community Contributors - Thank you to all who have contributed through issues and pull requests.
- PyPI Project Page: ai-code-summary