Closed benjaminrmoss closed 1 month ago
Hey @benjaminrmoss! I'm a bot here to help you out with bugs, questions, and more. Let's tackle this issue together!
Yes, DevLake allows for the retention of pipeline run data independently of the source system's retention policies through its synchronization policy settings. You can configure the sync frequency, time range, and the skip-on-fail option for your data, which helps in managing how and when data is collected and retained [1][2].
Additionally, DevLake supports different data collection modes, including "Collect Data (Default)," "Collect Data in Full Refresh Mode," and "Re-transform Data," which provide flexibility in how data is managed and retained [1][2].
For ensuring that historical pipeline run data is preserved, you can follow these best practices:
By following these practices, you can ensure that historical pipeline run data is consistently preserved in DevLake, even if the actual pipeline runs in Azure DevOps have been removed.
To continue talking to Dosu, mention @dosu.
@benjaminrmoss You can run your project in incremental
mode. It will only collect data that changes therotically since last successful sync.
@d4x1 Thanks for the reply. Would I set this in the project configuration settings? I can't see any option for it here: Maybe this feature isn't supported for ADO yet? https://devlake.apache.org/docs/Overview/SupportedDataSources/#azure-devops
@benjaminrmoss Yes Azuredevops is writen with Python, and doesn't support incremental mode. You can try azuredevop_go, it can collect data incremantally.
This issue has been automatically marked as stale because it has been inactive for 60 days. It will be closed in next 7 days if no further activity occurs.
This issue has been closed because it has been inactive for a long time. You can reopen it if you encounter the similar problem in the future.
Hi - The ADO project I'm working with has a pipeline rention policy of 3 previous runs. When running the data collection in DevLake, any runs that have expired or been deleted from ADO are removed, which leads to the DORA dashboard having no/less data.
Is there any way for me to configure DevLake to retain any pipeline metrics even if the actual pipeline run in ADO has been removed? Thanks for your help.