Develop a new module that identifies and masks large variable initializations in code files.

IBM / data-prep-kit

Open source project for data preparation of LLM application builders

Apache License 2.0

195 stars 114 forks source link

Search before asking

[X] I searched the issues and found no similar issues.

Component

Transforms/Other

Feature

Create a new module that identifies and masks large variable initializations in code files, specifically designed to improve the quality. The new module should:

Identify Large Initializations: Detect variable initializations that exceed a predefined threshold of lines or characters.
Mask Identified Sections: Replace the detected large initializations with a placeholder.
Provide Configuration Options: Allow customization of the threshold for what constitutes a "large" initialization and the format of the masking

Are you willing to submit a PR?

[ ] Yes I am willing to submit a PR!

IBM / data-prep-kit

Develop a new module that identifies and masks large variable initializations in code files. #385

Search before asking

Component

Feature

Are you willing to submit a PR?