Nonprofit-Open-Data-Collective / irs-efile-master-concordance-file

The Master Concordance File defines standards and provides documentation necessary to build structured databases from the IRS E-File XML files posted on AWS.
https://nonprofit-open-data-collective.github.io/irs-efile-master-concordance-file/
40 stars 6 forks source link

IRS Efile Master Concordance File

The Master Concordance File, available in CSV version in this repository, defines standards and provides documentation necessary to build structured databases from the IRS E-File XML files posted on AWS.

The IRS released the 990 E-FILER DATA as XML documents in 2016 with little documentation. The Master Concordance File (MCF) provides standards and conventions to assist the work of programmers that wish to utlize the data.

The MCF is meant to serve as a rosetta stone of sorts, allowing programmers to convert XML documents into a structured database by mapping 10,000 unique xpaths onto a consistent and well-documented data dictionary.

We designed consistent standards for variable naming conventions and documentation to improve ease of use.

The concordance organizes data into approximately 125 distinct tables that correspond with sections on the forms (approximately 80 one-to-one tables and 46 one-to-many tables).

DATA DICTIONARY

Also included here is an additional crosswalk for Part-01 of forms 990-EZ and the full 990. This section is somewhat distinct because many of the financial variables on Part-01 of the 990-EZ are not explicitly present on the full 990-PC, but they can be reconstructed (mostly) by creating composite variables by combining several fields.

FORM-990-EZ-PART-01-CROSSWALK.xlsx

Follow the MASTER CONCORDANCE FILE DOCUMENTATION link for an overview of informaton contained within the Master Concordance File.


DATA DICTIONARY FOR MASTER CONCORDANCE FILE

Note that the official form names are the "full" 990 (required for all public charities), the 990-EZ (for small public charities), and the 990-PF (for private foundations). In the documentation we refer to the full 990 as the 990-PC for "public charity" (as opposed to the 990-PF for private foundations). That provides for additional consistency in formatting and prevents confusion about whether we mean the full or EZ version when referencing "form 990".

The efiler concordance.csv included in this repository consists of the following variables:

VARIABLE DEFINITIONS

LOCATION OF THE FIELD ON THE 990 FORM

Note that location codes were designed so sorting by location codes in the spreadsheet will place variables in the same order as they appear on the full version of the 990-PC or schedules.

Fields often appear in tables with letters indexing columns and line numbers indexing rows. In these cases location codes will sort from left to right, then top to bottom.

F990-PC-PART-07-SECTION-B-LINE-01-COL-A
F990-PC-PART-07-SECTION-B-LINE-01-COL-C
F990-PC-PART-07-SECTION-B-LINE-01-COL-B

TABLE INFO

XML SCHEMA INFO (from XSD files)

A more in-depth description of each variable is available HERE.


FORM 990 and SCHEDULE SECTIONS

*These groups are based upon the 2016 IRS 990 Forms and Schedules.

FORM 990-PC (FULL)

SCHEDULE A - PUBLIC CHARITY STATUS AND PUBLIC SUPPORT (PC AND EZ FILERS)

Complete if the organization is a section 501(c)(3) organization or a section 4947(a)(1) nonexempt charitable trust.

SCHEDULE B - SCHEDULE OF CONTRIBUTORS (PC, EZ, AND PF FILERS)

SCHEDULE C - POLITICAL CAMPAIGN AND LOBBYING ACTIVITIES (PC AND EZ FILERS)

SCHEDULE D - SUPPLEMENTAL FINANCIAL STATEMENTS (PC FILERS ONLY)

Complete if the organization answered “Yes” on Form 990, Part IV, line 6, 7, 8, 9, 10, 11a, 11b, 11c, 11d, 11e, 11f, 12a, or 12b.

SCHEDULE E - SCHOOLS (PC AND EZ FILERS)

Complete if the organization answered “Yes” on Form 990, Part IV, line 13, or Form 990-EZ, Part VI, line 48.

SCHEDULE F - STATEMENT OF ACTIVITIES OUTSIDE THE UNITED STATES (PC FILERS ONLY)

Complete if the organization answered “Yes” on Form 990, Part IV, line 14b, 15, or 16.

SCHEDULE G - SUPPLEMENTAL INFORMATION REGARDING FUNDRAISING OR GAMING ACTIVITIES (PC AND EZ FILERS)

Complete if the organization answered “Yes” on Form 990, Part IV, line 17, 18, or 19, or if the organization entered more than $15,000 on Form 990-EZ, line 6a.

SCHEDULE H - HOSPITALS (PC FILERS ONLY)

Complete if the organization answered “Yes” on Form 990, Part IV, question 20.

SCHEDULE I - GRANTS AND OTHER ASSISTANCE TO ORGANIZATIONS, GOVERNMENTS, AND INDIVIDUALS IN THE UNITED STATES (PC FILERS ONLY)

Complete if the organization answered “Yes” on Form 990, Part IV, line 21 or 22.

SCHEDULE I1 - Continuation Sheet for Schedule I (Form 990)

SCHEDULE J - COMPENSATION INFORMATION (PC FILERS ONLY)

Complete if the organization answered “Yes” on Form 990, Part IV, line 23

SCHEDULE J1 - CONTINUATION SHEET FOR SCHEDULE J

SCHEDULE J2 - CONTINUATION SHEET FOR FORM 990

Attach to Form 990 to list additional information for Form 990, Part VII, Section A, line 1a.

SCHEDULE K - SUPPLEMENTAL INFORMATION ON TAX-EXEMPT BONDS (PC FILERS ONLY)

Complete if the organization answered “Yes” on Form 990, Part IV, line 24a. Provide descriptions, explanations, and any additional information in Part VI.

SCHEDULE L - TRANSACTIONS WITH INTERESTED PERSONS (PC AND EZ FILERS)

Complete if the organization answered “Yes” on Form 990, Part IV, line 25a, 25b, 26, 27, 28a, 28b, or 28c, or Form 990-EZ, Part V, line 38a or 40b.

SCHEDULE M - NONCASH CONTRIBUTIONS (PC FILERS ONLY)

Complete if the organizations answered “Yes” on Form 990, Part IV, lines 29 or 30.

SCHEDULE N - LIQUIDATION, TERMINATION, DISSOLUTION, OR SIGNIFICANT DISPOSITION OF ASSETS (PC AND EZ FILERS)

Complete if the organization answered "Yes" on Form 990, Part IV, lines 31 or 32; or Form 990-EZ, line 36.

SCHEDULE O - SUPPLEMENTAL INFORMATION TO FORM 990 OR 990-EZ

SCHEDULE R - RELATED ORGANIZATIONS AND UNRELATED PARTNERSHIPS (PC FILERS ONLY)

Complete if the organization answered "Yes" on Form 990, Part IV, line 33, 34, 35b, 36, or 37.