As a part of the Master's thesis at the University of Amsterdam and ITMO University.
Keywords – REPRESENTATION LEARNING, FINANCIAL STATEMENTS, NETWORKS, AUDIT, SAMPLING STRATEGY, SKIP-GRAM MODEL, TRANSACTION DATA
The solution relies on both modelling techniques and machine learning. We give a detail definition of sampling strategy, finWalk on a Financial statement network. The novelty of it is to follow directions of relationships on the network rather than directions of edges. As a result, after learning embeddings, one allows merging a large number of business processes into groups as well as revealing an actual meaning of these groups.
In the experiments, we demonstrate the results of applying our coarse-graining procedure to simulated. Moreover, we establish the fact that plausible relationship models considering the predicted labels have the same order of accuracy as the models operating with expert labels. Owing to the framework for data simulation (Simulation folder), we ensure the repeatability of our findings and encourage further investigation and improvements.