dwyl / alog

🌲 alog (Append-only Log) is an easy way to start using the Lambda/Kappa architecture in your Elixir/Phoenix Apps while still using PostgreSQL (with Ecto).
GNU General Public License v2.0
15 stars 2 forks source link

How to handle inserting duplicate data? #53

Open RobStallion opened 5 years ago

RobStallion commented 5 years ago

relates to #45

We have created a function in the adapter for handling inserting data into the database. This function will create a cid and entry_id(which we are considering renaming to id, see here) based on the data passed in and adds them to the params passed to insert.

This will ensure that the cid and entry_id data is inserted into the database.

As a rule we don't want to insert duplicate data as it is a waste. We will use the cid value generated to check if the value is a duplicate and if so we can "reject" the insert.

This works well with the current set up. However, imagine we have a table user which has one column called username.

If our user is originally called batman but decides to change their name to the dark knight. This will work and we will insert a new row in the database.

Now the user decided that they want to change their name back to batman. If we try to insert this, we will get an error because we will be creating a duplicate cid value (which is obviously not what we want in this case).

How do we want to handle this case?

Please let me know your thoughts and if you have any other ideas about this.

nelsonic commented 5 years ago

@RobStallion really good question(s) here thank you for opening this issue to capture the discussion.

In the first instance we need to re-direct /batman to /darkknight so that people searching for /batman will find his new alias and see his profile. But when he decides to change back, the /darkknight should re-direct to /batman ...

Similar to how Prince (RIP) rebranded as image "Symbol" (The Artist Formerly Known as Prince) ... https://en.wikipedia.org/wiki/Prince_(musician) and then a few years later changed back to "Prince". If you search for "symbol musician" Prince' wikipedia comes up.

This is all "fine" for a couple of name/alias changes, but what happens if someone spots this "loophole" decides to programmatically change their name to all top 100k usernames on Twitter and all words in the dictionary, then a person could "own" the redirect for all the usernames and words i.e. every name redirects to "bob".

If we are using this feature in our own app(s) we will need to limit it's use to avoid this "hack".