This pull request introduces a new utility script, "PDF Organizer," designed to automate the organization and management of PDF files within a specified directory. By analyzing PDF metadata, the script efficiently categorizes PDFs into subfolders, renames files for uniformity, and generates a summary report of the process.
Features
Metadata Extraction: Utilizes PyPDF2 to extract metadata (title, author) from PDFs.
Dynamic Organization: Automatically sorts PDFs into author-named subfolders.
Standardized Renaming: Renames PDF files to a consistent "Author - Title.pdf" format.
Summary Reporting: Generates a report detailing the organization outcome, including counts of PDFs per category.
Motivation
Managing a large collection of PDF files can be tedious, especially when dealing with academic papers, legal documents, or extensive reports. This script aims to simplify file management, making it easier for users to locate and organize their PDFs based on content relevance.
How to Use
Ensure PyPDF2 is installed: pip install PyPDF2
Modify the source_folder and report_path in the script to match your directory structure.
Run the script: python pdf_organizer.py
Future Enhancements
Support for additional metadata fields (e.g., subject, keywords) for finer categorization.
Interactive mode allowing users to select organization criteria directly.
GUI for ease of use by non-technical users.
I believe this script will be a valuable addition to the repository, offering a practical solution to a common problem faced by many. I look forward to feedback and any suggestions for improvement.
Summary
This pull request introduces a new utility script, "PDF Organizer," designed to automate the organization and management of PDF files within a specified directory. By analyzing PDF metadata, the script efficiently categorizes PDFs into subfolders, renames files for uniformity, and generates a summary report of the process.
Features
Motivation
Managing a large collection of PDF files can be tedious, especially when dealing with academic papers, legal documents, or extensive reports. This script aims to simplify file management, making it easier for users to locate and organize their PDFs based on content relevance.
How to Use
pip install PyPDF2
source_folder
andreport_path
in the script to match your directory structure.python pdf_organizer.py
Future Enhancements
I believe this script will be a valuable addition to the repository, offering a practical solution to a common problem faced by many. I look forward to feedback and any suggestions for improvement.