stackabletech / issues

This repository is only for issues that concern multiple repositories or don't fit into any specific repository
2 stars 0 forks source link

Define a scenario with personas across products #524

Closed fhennig closed 5 months ago

fhennig commented 7 months ago

We need a realistic foundation to write rules for. This means multiple users, data and groups.

What the scenario should cover

Acceptance criteria

NickLarsenNZ commented 7 months ago

[!IMPORTANT] Scenario has moved from this issue into the opa-operator repo: stackabletech/opa-operator#522

Here's a starter for ten, feel free to amend if it doesn't sit right with you...

Knab

Knab is a bank with a number of data science teams covering various aspects of the banks operation and compliance obligations.

Usernames are $firstname.$lastname.

Teams

Two of the data science teams are outlined below:

Compliance and Regulation Analytics

This team sits under the wider Compliance and Regulation team,and is tasked with making use of the banks customer, credit, securities data and produce internal and regulatory reports to aid in regulatory compliance.

Members of the team:

Customer Analytics

This team falls under the Customer Service division and is tasked with making use of the banks data for things like:

They produce monthly and quarterly reports, but also build dashboards to show live data (eg: call queue, and wait time statistics).

Members of the team:

fhennig commented 7 months ago

I'd also add a manager that wants access to superset dashboards and migh be part of a manager group. And another idea is an external contractor that only has access to trino as a one-off or something.

For multi-tenancy maybe for trino/HDFS there could be dev and prod deployments?

NickLarsenNZ commented 7 months ago

I'd also add a manager that wants access to superset dashboards and migh be part of a manager group.

IMO "managers having extra access" is an old tale used in examples. Managers typically wouldn't have extra access, and in fact would probably have no access.

Team Lead(s) could be in the managers/admins group, if that is satisfactory?

And another idea is an external contractor that only has access to trino as a one-off or something.

Cool, will add an extra persona for that.

For multi-tenancy maybe for trino/HDFS there could be dev and prod deployments?

Do you mean Trino with HDFS, or one-or-the-other? I'm not so familiar, so maybe my question is stupid. I do see Trino + Hive as a thing though.

fhennig commented 6 months ago

IMO "managers having extra access" is an old tale used in examples. Managers typically wouldn't have extra access, and in fact would probably have no access.

That's fine for me!

Do you mean Trino with HDFS, or one-or-the-other? I'm not so familiar, so maybe my question is stupid. I do see Trino + Hive as a thing though.

I mean that there are two instances of something (maybe Trino, maybe HDFS or maybe something else). Maybe this makes more sense for something like Spark? A dev cluster and a prod cluster. I'm also not super familiar with where something like this is done realisitcally. But maybe Knab has a transaction processing pipeline in Spark and when they make a change to it they first deploy it in a dev cluster before rolling it out to prod. So there would be some shared permissions for dev and prod, but also prod would be more locked down probably.

NickLarsenNZ commented 6 months ago

Scenario has moved from this issue into the opa-operator repo: stackabletech/opa-operator#522

NickLarsenNZ commented 6 months ago

[!NOTE] I moved this comment out of stackabletech/opa-operator#522 so it could be closed without waiting on the following:

I spoke with @fhennig yesterday and he suggested not delving into groups yet (at least not for this ticket), but I still need to come up with various tasks that the personas would perform.

So freestyling here:

Compliance and Regulation Analytics

todo

Customer Analytics

Marketing

todo

NickLarsenNZ commented 6 months ago

@fhennig what shall we do here (considering the discussion about changing the scenario to align to Trino TPCDS data)?

fhennig commented 5 months ago

Alright, we have the files here: https://github.com/stackabletech/opa-operator/tree/main/hack

and the additional text in this ticket. I think for me that's enough for this ticket, I'm closing it.