awesomedata / awesome-public-datasets

A topic-centric list of HQ open datasets.
https://awesomedataworld.slack.com
MIT License
59.18k stars 9.76k forks source link

Bank customer transactions #234

Open Aleyasen opened 7 years ago

Aleyasen commented 7 years ago

I am wondering is there any public bank customers (anonymized) transaction data?

aqibsaeed commented 7 years ago

I am also looking for one, related to fake complaints about ATM transactions.

Aleyasen commented 7 years ago

There is a dataset, that has real transactions but it doesn't have any label for fraud detection. It is Berka dataset available as part of PKDD'99 Discovery Challenge. It is real anonymized data from Czech bank. http://sorry.vse.cz/~berka/challenge/pkdd1999/data_berka.zip Data description: http://lisp.vse.cz/pkdd99/berka.htm

UPDATE: It seems the link for data description doesn't work anymore, so please use this link from WebArchive: https://web.archive.org/web/20161019192412/http://lisp.vse.cz/pkdd99/berka.htm

Also, I found these Python Notebooks that did some analysis on the data that might be helpful: https://github.com/justinng1/berka

aqibsaeed commented 7 years ago

Awesome man! Thanks alot

mcapelle commented 6 years ago

It seems, a password is needed to decryt the data. Does one of you has (still) the decrypted data or the password?

Aleyasen commented 6 years ago

@elieArron1 That's great. Can you share the dataset?

aqibsaeed commented 6 years ago

@ElieArron1 Would you also highlight, what kind of modelling purpose it is useful for? Are there any annotations for fake transaction etc? What does MCC code represents?

harrycheese commented 6 years ago

@aqibsaeed

Mcc means merchant category code. Each code represents a category viz. Automobile, restaurant, convinience store etc. Depending on bank to bank / transaction processor the category varies

RJDavie commented 6 years ago

Thanks @harrycheese I wish everyone would use SIC codes though, they are standard pretty much across the world. #datawoes!

harrycheese commented 6 years ago

Credit card and debit card companies have been using it since ages. It's built in their legacy systems. Changes to it means cost :)

Anywhoo.. @ElieArron1 can u share this data, will be useful to mine it. Also which country's bank is it and is it a huge bank or mid size bank or small bank.

On 06-Feb-2018 5:40 PM, "Rob Davie" notifications@github.com wrote:

Thanks @harrycheese https://github.com/harrycheese I wish everyone would use SIC codes though, they are standard pretty much across the world.

datawoes!

— You are receiving this because you were mentioned.

Reply to this email directly, view it on GitHub https://github.com/awesomedata/awesome-public-datasets/issues/234#issuecomment-363403592, or mute the thread https://github.com/notifications/unsubscribe-auth/AiVOTwEr3xdwojfegF-VoLpsJBm1ou6mks5tSEEmgaJpZM4JGVnn .

mcapelle commented 6 years ago

@ElieArron1 I would be interested in a 10 customers sample of the (1) dataset to evaluate it. How can you send it to me? Thanks.

kjmckenzie commented 6 years ago

@ElieArron1

Can I also get a copy of the dataset?

Thanks!

ayushsingh244617 commented 6 years ago

can i get the dataset too?

qinbill commented 6 years ago

@ElieArron1 qinbill@gmail.com I am work on graph pattern mining in financial applications. Could you share me the data?

LPalazzo commented 6 years ago

@ElieArron1 can I also get a copy of it? I would like to use generalized linear methods in a time series framework to model transactions over time

ayushsingh244617 commented 6 years ago

I need it for educational purpose.im trying to study the auditing system in banks and for that I need a data set

On Feb 28, 2018 3:25 PM, "L" notifications@github.com wrote:

@ElieArron1 https://github.com/eliearron1 can I also get a copy of it? I would like to use generalized linear methods in a time series framework to model transactions over time

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/awesomedata/awesome-public-datasets/issues/234#issuecomment-369186833, or mute the thread https://github.com/notifications/unsubscribe-auth/AhmAIjHY361hU7VR2wF-I3L8gI-yY1COks5tZSKLgaJpZM4JGVnn .

ctacampado commented 6 years ago

@ElieArron1 would also like to have a copy of your dataset to do prediction and classification for a prototype of a financial application. ct.acampado@gmail.com

leonardosapiras commented 6 years ago

@ElieArron1

Hi, I am doing some academic researches in the university where I teach. Could you share this dataset with sapiras@faccat.br ?

Thanks!

ayushsingh244617 commented 6 years ago

This is my email

On Feb 28, 2018 9:04 PM, "ElieArron1" notifications@github.com wrote:

@LPalazzo https://github.com/lpalazzo @ayushsingh244617 https://github.com/ayushsingh244617

Give me your emails we can discuss it

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/awesomedata/awesome-public-datasets/issues/234#issuecomment-369276289, or mute the thread https://github.com/notifications/unsubscribe-auth/AhmAItTt7phnf457_srtGMkyZfUaEYA5ks5tZXGdgaJpZM4JGVnn .

thatdeep commented 6 years ago

@ElieArron1 Hi! I am starting a research on user behaviour modeling and graph-based approaches for finding customers with similar transaction patterns. Having real-life data would be good to play around and tune methods. Could you share your dataset with me too? My email is const.belev@gmail.com

LPalazzo commented 6 years ago

@ElieArron1 luc.palazzo@gmail.com

hyrahul commented 6 years ago

Hey, I don't have any kind of dataset that you are looking for why don't you google it about that !!!

Thanks

On Tue, Mar 6, 2018 at 3:44 PM, L notifications@github.com wrote:

@ElieArron1 https://github.com/eliearron1 luc.palazzo@gmail.com

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/awesomedata/awesome-public-datasets/issues/234#issuecomment-370731829, or mute the thread https://github.com/notifications/unsubscribe-auth/Aix1m23xkX_1dhKD4erUjDTN6jM6kpodks5tbmFqgaJpZM4JGVnn .

aayush209 commented 6 years ago

@ElieArron1 Can you share the dataset with me , I need it for fraud detection using vizualisation . Email - aayushgupta1124@gmail.com

ayushsingh244617 commented 6 years ago

My email Id is ayushsingh244617@gmail.com

On Wed, Feb 28, 2018, 9:04 PM ElieArron1 notifications@github.com wrote:

@LPalazzo https://github.com/lpalazzo @ayushsingh244617 https://github.com/ayushsingh244617

Give me your emails we can discuss it

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/awesomedata/awesome-public-datasets/issues/234#issuecomment-369276289, or mute the thread https://github.com/notifications/unsubscribe-auth/AhmAItTt7phnf457_srtGMkyZfUaEYA5ks5tZXGdgaJpZM4JGVnn .

Sergionethechamp commented 6 years ago

Hello @ElieArron1 I would love to have access to the dataset as well Conducting research and building model on spending patterns and credit relations My email is sergione.moronetti@gmail.com Happy to exchange in PM Thx!

csywchen commented 6 years ago

Hi @ElieArron1, I am conducting some research work on anomaly detection over customer transactions. Would you please share the dataset with me? My email is csywchen@gmail.com. Thanks a lot!

mcapelle commented 6 years ago

@ElieArron1 Hi, I would like to study expenses patterns thanks to that dataset. Can you send it to maxime.capelle@gmail.com?

Thanks.

alexjslessor commented 6 years ago

Hello I am trying to build fraud detection models using RNN's in tensorflow could I have it as well please? You can send it to alexjslessor@gmail.com

On Fri, Mar 16, 2018, 11:21 AM Maxime Capelle notifications@github.com wrote:

@ElieArron1 https://github.com/eliearron1 Hi, I would like to study expenses patterns thanks to that dataset. Can you send it to maxime.capelle@gmail.com?

Thanks.

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/awesomedata/awesome-public-datasets/issues/234#issuecomment-373746675, or mute the thread https://github.com/notifications/unsubscribe-auth/AVuZy8Js0SffbnSPa8uc0kHf0AXbj9_Eks5te9iBgaJpZM4JGVnn .

theclanks commented 6 years ago

Hi @ElieArron1, I am starting a research on debt collector scores, could you share your dataset with me too? My email is otte@usp.br

Thanks.

Frank-Tsai commented 6 years ago

HI @ElieArron1 I'm currently working on research about churn prediction, could you please share a copy with me? Here's my email stu8978@gmail.com

Thanks a lot

matheusalagia commented 6 years ago

Hi @ElieArron1 I would love to have access to the dataset as well Conducting research and building model on spending patterns and credit relations My email is matheusalagia@gmail.com

Thanks a lot!

AlexWorldD commented 6 years ago

Hi, @ElieArron1 One of my current project is Bank's client classification and It'll be great if you can share your dataset with me for sandboxes modeling. And yep, MCC codes looks quite perspective in my view for that purpose. Thx

StephanieJoyMills commented 6 years ago

Hi, @ElieArron1 I was hoping I could also get access to the dataset. I'm trying to build a model on spending habits My email is mills.stephanie.j@gmail.com

Thanks :)

rjohn46 commented 6 years ago

Hi @ElieArron1 , would you mind sharing with me? I'm working on some spending behavior modeling. My email is richard@vinuio.com

Thanks very much!

Lideti commented 6 years ago

Hi @ElieArron1 , could you please share the with me as well? I'm a uni student and I'm designing a model for spending habits and future liquidity requirement. My email is mangaka2023@gmail.com. Please get in touch if you have any other concern. Thank you so much in advance!

EdoardoPaluan commented 6 years ago

Hi @ElieArron1 , It would be great if you could share the dataset with me as well, I am looking to create a classifier for financial transactions and mapping the dataset you have with the MCC reference codes I have would be very useful! My email is: edoardo.paluan@hotmail.com. Thank you so much!

saurabhk7 commented 6 years ago

@ElieArron1 Hi, could you share your dataset to saurabhkshirsagar35@gmail.com we plan to build an intelligent investment reccomendation system for customer benefit. Your dataset will be perfect for us. Thank You for your help.

shlomioved1234 commented 6 years ago

@ElieArron1 Hi, can you please send me your dataset? My email is shlomioved123@gmail.com. I am need of such data for educational reasons. I'm trying to study different banking transactions.

saravanan-thirumuruganathan commented 6 years ago

@ElieArron1 , can you send me your dataset at saravanan.thirumuruganathan@gmail.com . I am interested in analyzing it for fraud detection methodologies and product recommendation. Thanks a bunch!

khashishin commented 6 years ago

@ElieArron1 I would very grateful for this dataset. I'm currently trying to pursue my PhD connected with fraud detection and the use of behavioral profiling for lowering the fraud probabilities and banking transaction datasets would help me alot.

piotr.kaluzny@ue.poznan.pl

If you would share the dataset in the name of science improvement, I can't really express how grateful I would be.

ghost commented 6 years ago

hi @ElieArron1 I would be glad if you share with me too. I will use it in msd project about clustering similar entities on banking. (behaviorial similarity). blu3nx@hotmail.com thanks

danntapl commented 6 years ago

Hi @ElieArron1 , could you please send me your dataset? My email is danntapl@gmail.com. I'm trying to classify transactions. Thanks a lot

Imbasa-Solutions commented 6 years ago

Hi @ElieArron1, please can you also send me the dataset, I would like to perform some exploratory date analysis on it for academic purposes. Much appreciated. mathematicsnine(at)gmail(dot)com

mic0331 commented 6 years ago

Hello @ElieArron1, could you please also share this dataset with me - purpose is data exploration and machine learning - my email is "mic0331 AT gmail DOT com" - thanks very much

akando42 commented 6 years ago

Hello @ElieArron1 , could i get a copy of the dataset as well? We are working on a risk assessment model for business loan/ investment using bank transaction data, credit scrore and financial statements. my email is troy@topflightapps.com

bernardworthy commented 6 years ago

Hi @ElieArron1, I'm in the early phases of building a machine learning model that predicts financial health based on transaction data and your dataset would be perfect. Could you send to abworthy(at)gmail(dot)com? Thank you very much!

amirmoghadam93 commented 6 years ago

Dear @ElieArron1 , I'm working on my Master's thesis statement on the application of data mining for detection of money laundering suspicious cases. I was wondering if you could send me your dataset. My email: amir.moghadam@ut.ac.ir

naremanalazem commented 6 years ago

Hi @ElieArron1, I am starting a research one detection of money laundering in banks .., could you share your dataset with me too? My email is teacherit1992@gmail.con

Thanks.

saravanan-thirumuruganathan commented 6 years ago

Has any one gotten the data from @ElieArron1 ? I requested quite some time back and didnt receive any. If most of us have not received anything, we should probably stop asking!

gopalkalpande commented 6 years ago

I want to simulate the bank transaction for investment and deposits of customers. plz share the data set. email id : gopalkalpande@gmail.com

danntapl commented 6 years ago

Long time ago I asked @ElieArron1 and other from this link about the data, but I did not get answer.Maybe they are not following this section now, but if someone could share it , It would be greit. Regards!!!