raystack / guardian

Guardian is universal data access management tool with automated access workflows and security controls across data stores, analytical systems, and cloud products.
https://guardian.vercel.app/
Apache License 2.0
137 stars 21 forks source link

Data access monitoring #171

Open ravisuhag opened 2 years ago

ravisuhag commented 2 years ago

Guardian can manage access across multiple providers. But it is still hard for data governance managers to monitor the different aspects of data access.

Goal: With data access monitoring in Guardian, we aim to provide answers to the following questions.

Scope: Access monitoring can be tracked across different sections

ravisuhag commented 2 years ago

One way to identify the sensitivity of data could be with labels on datasets.

Few more questions, that insights can provide answers to:

  1. How many times a resource is accessed
  2. How many times sensitive BigQuery tables are getting accessed
  3. Identify what operations are done on a resource
  4. Identify users with excess access rights
  5. Alert for bulk download or high-risk flagging for an event
  6. Breach Prediction index - a type of risk modeling as per best practices
mabdh commented 2 years ago

Usage history (access logs) data collection will be discussed in detail in #265