ihmeuw / pseudopeople

pseudopeople is a Python package that generates realistic simulated data about a fictional United States population, designed for use in testing entity resolution (record linkage) methods or other data science algorithms at scale.
https://pseudopeople.readthedocs.io
BSD 3-Clause "New" or "Revised" License
20 stars 2 forks source link

Working with duplicates of individuals #455

Closed BrianJSteven closed 2 months ago

BrianJSteven commented 2 months ago

What is the name of your project?

Duplicate Identification

What is the purpose of your project?

We hope to be able to identify duplicates of individuals in very large dataframes.

Who is involved in the project? Which of these people will have direct access to the pseudopeople input data?

Both myself, Anna Grace and John G will have access to the data.

What funding is the project under? What expectations with respect to open access and access to data come with that funding?

The project is part of the Melton Scholars at the University of Tennessee Knoxville.

We commit to:

What data would you like to request?

Other data - more explanation

No response

aflaxman commented 2 months ago

Am I correct to think of this as the same project as the one from #454?

BrianJSteven commented 2 months ago

That is correct!


From: Abraham Flaxman @.> Sent: Saturday, September 7, 2024 3:56 PM To: ihmeuw/pseudopeople @.> Cc: Stevens, Brian @.>; Author @.> Subject: Re: [ihmeuw/pseudopeople] Working with duplicates of individuals (Issue #455)

Am I correct to think of this as the same project as the one from #454https://github.com/ihmeuw/pseudopeople/issues/454?

— Reply to this email directly, view it on GitHubhttps://github.com/ihmeuw/pseudopeople/issues/455#issuecomment-2336424382, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AFUM7T5LHXYUTNEYHFZJ3B3ZVNK73AVCNFSM6AAAAABNZQJPJ2VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDGMZWGQZDIMZYGI. You are receiving this because you authored the thread.Message ID: @.***>

Ironholds commented 2 months ago

Awesome! I am going to close this and redirect it to #454 since that has much more detail on the application :)