Niketkumardheeryan / ML-CaPsule

ML-capsule is a Project for beginners and experienced data science Enthusiasts who don't have a mentor or guidance and wish to learn Machine learning. Using our repo they can learn ML, DL, and many related technologies with different real-world projects and become Interview ready.
MIT License
333 stars 294 forks source link

Document scanner #799

Open arunitasahu opened 3 weeks ago

arunitasahu commented 3 weeks ago

closes : #766 The document scanning process involves several key steps:

  1. Image Loading: Load the input image containing the documents to be scanned.
  2. Preprocessing: Preprocess the image by converting it to grayscale, applying Gaussian blur to reduce noise, and performing edge detection to detect document contours.
  3. Contour Detection: Identify contours in the edge-detected image to locate the boundaries of the document(s).
  4. Perspective Transformation: Apply perspective transformation to obtain a top-down view of the document(s) by correcting for perspective distortion.
  5. Displaying the Result: Display the original image and the scanned documents side by side for visual comparison.
arunitasahu commented 3 weeks ago

@invigorzz313 @Niketkumardheeryan Please review my pr and in case of no problems merge it with suitable label please.

invigorzz313 commented 2 weeks ago

@arunitasahu The description you wrote in the ReadMe.md file, its better if you include it in the .ipynb itself. Then the notebook would be interactive with proper code and text cells. Because we expect the code to be properly commented with explanations. As for the ReadMe.md, you can describe the techstack used, libraries used and aim & objective. Also include sample examples in it (original and final image pairs).

arunitasahu commented 2 weeks ago

@invigorzz313 I have made the required changes .Please have a look into it.

Niketkumardheeryan commented 1 week ago

@arunitasahu No cell Executed in your .ipynb file, please make it better readable with more commenting , also show used images in code ,

arunitasahu commented 1 week ago

To illustrate how the code works and what the output looks like, I have added sample images. These images show both the functioning of the code and the appearance of the output.

Screenshot 2024-06-21 020435 Screenshot 2024-06-21 020510

@Niketkumardheeryan @invigorzz313

invigorzz313 commented 5 days ago

@arunitasahu It is suggested that you display the input and output images in the code too. For this you can use matplotlib library to display the figures. That way, the images are always visible instead of using cv2.waitkey for temporary viewing. And put in more comments example the "preprocessing and finding document contours" part, you can explain the major important commands being used.