LibraryOfCongress / api.congress.gov

congress.gov API
601 stars 38 forks source link

(daily-congressional-record): `entireIssue` with `"type": "Formatted Text"` does not include entire text #220

Open ryparker opened 3 months ago

ryparker commented 3 months ago

When requesting a record from the daily-congressional-record API, links to the "Formatted Text" version of entireIssue do not include all of the text from the record.

e.g.

curl --location 'https://api.congress.gov/v3/daily-congressional-record/170/13?format=json&api_key=<API_KEY>'

Response:

{
    "issue": {
        "congress": 118,
        "fullIssue": {
            "articles": {
                "count": 59,
                "url": "https://api.congress.gov/v3/daily-congressional-record/170/13/articles?format=json"
            },
            "entireIssue": [
                {
                    "part": "1",
                    "type": "PDF",
                    "url": "https://www.congress.gov/118/crec/2024/01/24/170/13/CREC-2024-01-24.pdf"
                },
                {
                    "part": "1",
                    "type": "Formatted Text",
                    "url": "https://www.congress.gov/118/crec/2024/01/24/170/13/CREC-2024-01-24-pt1-PgD69.htm"
                }
            ],
            "sections": [
                {
                    "endPage": "D72",
                    "name": "Daily Digest",
                    "startPage": "D69",
                    "text": [
                        {
                            "type": "PDF",
                            "url": "https://www.congress.gov/118/crec/2024/01/24/170/13/CREC-2024-01-24-dailydigest.pdf"
                        }
                    ]
                },
                {
                    "endPage": "E69",
                    "name": "Extensions of Remarks Section",
                    "startPage": "E69",
                    "text": [
                        {
                            "part": "1",
                            "type": "PDF",
                            "url": "https://www.congress.gov/118/crec/2024/01/24/170/13/CREC-2024-01-24-extensions.pdf"
                        }
                    ]
                },
                {
                    "endPage": "S253",
                    "name": "Senate Section",
                    "startPage": "S227",
                    "text": [
                        {
                            "part": "1",
                            "type": "PDF",
                            "url": "https://www.congress.gov/118/crec/2024/01/24/170/13/CREC-2024-01-24-senate.pdf"
                        }
                    ]
                }
            ]
        },
        "issueDate": "2024-01-24T05:00:00Z",
        "issueNumber": "13",
        "sessionNumber": 2,
        "updateDate": "2024-01-25T10:45:11Z",
        "url": "https://api.congress.gov/v3/daily-congressional-record/170/13?format=json",
        "volumeNumber": 170
    },
    "request": {
        "contentType": "application/json",
        "format": "json",
        "issueNumber": "13",
        "volumeNumber": "170"
    }
}

Find the fullIssue.entireIssue links:

"entireIssue": [
                {
                    "part": "1",
                    "type": "PDF",
                    "url": "https://www.congress.gov/118/crec/2024/01/24/170/13/CREC-2024-01-24.pdf"
                },
                {
                    "part": "1",
                    "type": "Formatted Text",
                    "url": "https://www.congress.gov/118/crec/2024/01/24/170/13/CREC-2024-01-24-pt1-PgD69.htm"
                }
            ],

Notice how the formatted text URL includes a page/section number (-PgD69): https://www.congress.gov/118/crec/2024/01/24/170/13/CREC-2024-01-24-pt1-PgD69.htm

Download the "Formatted Text" version:

curl 'https://www.congress.gov/118/crec/2024/01/24/170/13/CREC-2024-01-24-pt1-PgD69.htm' -o '170-13-entire-issue.html'

Notice how the response is only a portion of the daily digest. I had expected this to include everything from the issue, including the full daily digest.

<html> <head> <title>Congressional Record, Volume 170 Issue 13 (Wednesday, January 24, 2024)</title> </head> <body><pre> [Congressional Record Volume 170, Number 13 (Wednesday, January 24, 2024)] [Daily Digest] [Pages D69-D70] From the Congressional Record Online through the Government Publishing Office [<a href="https://www.gpo.gov">www.gpo.gov</a>] Wednesday, January 24, 2024 [[Page D69]] Daily Digest Senate Chamber Action Routine Proceedings, pages S227-S253 Measures Introduced: Eleven bills and one resolution were introduced, as follows: S. 3647-3657, and S. Res. 531. Pages S250-51 Measures Passed: Train More Nurses Act: Committee on Health, Education, Labor, and Pensions was discharged from further consideration of S. 2853, to require the Secretary of Health and Human Services and the Secretary of Labor to conduct a study and issue a report on grant programs to support the nursing workforce, and the bill was then passed. Page S253 Appointments: Washington's Farewell Address: The Chair, on behalf of the Vice President, pursuant to the order of the Senate of January 24, 1901, as modified by the order of January 24, 2024, appointed Senator Cardin to read Washington's Farewell Address on Monday, February 26, 2024. Page S253 Washington's Farewell Address--Agreement: A unanimous-consent agreement was reached providing that notwithstanding the order of January 24, 1901, the traditional reading of Washington's Farewell Address take place on Monday, February 26, 2024, following the Prayer and Pledge of the Flag. Page S253 Message from the President: Senate received the following message from the President of the United States: Transmitting, pursuant to the Constitution, the report of the veto of S.J. Res. 38, a joint resolution that would disapprove under chapter 8 of title 5, United States Code, an action by the Federal Highway Administration relating to ``Waiver of Buy America Requirements for Electric Vehicle Chargers''; ordered to be printed in the Record, spread in full upon the Journal, and held at the desk. (PM-35) Page S248 =========================== NOTE =========================== On page D69, January 24, 2024, the following language appears: Message from the President: Senate received the following message from the President of the United States: Transmitting, pursuant to the Constitution, the report of the veto of S.J. Res. 32, a joint resolution that would disapprove under chapter 8 of title 5, United States Code, an action by the Federal Highway Administration relating to ``Waiver of Buy America Requirements for Electric Vehicle Chargers''; ordered to be printed in the Record, spread in full upon the Journal, and held at the desk. (PM-35) The online Record has been corrected to read: Message from the President: Senate received the following message from the President of the United States: Transmitting, pursuant to the Constitution, the report of the veto of S.J. Res. 38, a joint resolution that would disapprove under chapter 8 of title 5, United States Code, an action by the Federal Highway Administration relating to ``Waiver of Buy America Requirements for Electric Vehicle Chargers''; ordered to be printed in the Record, spread in full upon the Journal, and held at the desk. (PM-35) ========================= END NOTE ========================= Lund Nomination--Agreement: Senate resumed consideration of the nomination of Gretchen S. Lund, of Indiana, to be United States District Judge for the Northern District of Indiana. Pages S236-46 A unanimous-consent agreement was reached providing that the motion to invoke cloture with respect to the nomination, be withdrawn, and that, notwithstanding Rule XXII, Senate vote on confirmation of the nomination at 11:30 a.m., on Thursday, January 25, 2024. Page S253 A unanimous-consent agreement was reached providing for further consideration of the nomination at approximately 10 a.m., on Thursday, January 25, 2024; that the motion to invoke cloture on the nomination of Kirk Edward Sherriff, of California, to be United States District Judge for the Eastern District of California, ripen upon disposition of the nomination of Gretchen S. Lund, and that if cloture is invoked on the nomination of Kirk Edward Sherriff, all time be considered expired and the vote on confirmation of the nomination be at a time to be determined by the Majority Leader, in consultation with the Republican Leader; that following the cloture vote on the nomination of Kirk Edward Sherriff, Senate resume consideration of the nomination of Joshua Paul Kolar, of Indiana, to be United States Circuit Judge for the Seventh Circuit, and that the motion to invoke cloture on the nomination ripen at 1:45 p.m. Page S253 Nominations Confirmed: Senate confirmed the following nominations: By 80 yeas to 17 nays (Vote No. EX. 20), Jacquelyn D. Austin, of South Carolina, to be United States District Judge for the District of South Carolina. Page S235 During consideration of this nomination today, Senate also took the following action: By 79 yeas to 17 nays (Vote No. EX. 18), Senate agreed to the motion to close further debate on the nomination. Page S232 By 67 yeas to 32 nays (Vote No. EX. 21), Cristal C. Brisco, of Indiana, to be United States District Judge for the Northern District of Indiana. Page S236 During consideration of this nomination today, Senate also took the following action: By 68 yeas to 29 nays (Vote No. EX. 19), Senate agreed to the motion to close further debate on the nomination. Page S233 [[Page D70]] Messages from the House: Bills Presented: e Communications:30Executive Reports of Committees: al Cosponsors:30Statements on Introduced Bills/Resolutions: al Statements: Authorities for Committees to Meet:30Record Votes: Four record votes were taken today. (Total--21) 30Record VPages S232-33, S235-36 Adjournment: Senate convened at 10 a.m. and adjourned at 7:10 p.m., until 10 a.m. on Thursday, January 25, 2024. (For Senate's program, see the remarks of the Majority Leader in today's Record on page S253.) </pre></body> </html>
apreiter18 commented 3 months ago

Thanks, @ryparker - I am going to investigate this and create a ticket.