It comes in two formats: One is a text document with with column size restrictions that makes it very hard to read, worse than the text version of an IETF RFC. The second is a machine readable XML document which itself isn't easily read.
Are there any good tools for viewing these? I did find GovTrack.us but it seems to be down so I'm not sure if it solves this problem.
[1]: https://www.congress.gov/bill/119th-congress/house-bill/1/text
OP may have been unlucky on the timing. The site isn't usually down. Here's the link to the text of H.R. 1 on GovTrack: https://www.govtrack.us/congress/bills/119/hr1/text
We automatically add links to U.S. Code and other citations. In this case Congress.gov is missing rich formatting which we have (I'm not sure why they are missing it for this bill, normally they have it). GovTrack also allows making diff-like comparisons between bill versions and between bills (for example, you can see the last-minute changes made ahead of the vote on this bill).
Source code is available on GitHub if anyone wants to try making GovTrack better, although it's quite complicated because Congressional information is complicated and there's no real money behind this: https://github.com/govtrack/govtrack.us-web/
If anyone has particular thoughts on what would be helpful when viewing bill text --- within the realm of the information that is actually freely available --- I am all ears.
There's excellent documentation on the formats and how to access all the data.
Except that it's a government thing so the parser's probably not going to be little. :)
Edit: The thing's basically XHTML without any kind of header. UTF-8 encoding, it looks like. So a conversion tool would just need to wrap it up and add styling.
Edit: Despite hints that it's XHTML, it's not valid XHTML.
Edit: Stick this at the top of the file:
--------------------- 8< ---------------------
H. R. 1
--------------------- 8< ---------------------
And add this to the bottom of the file:
--------------------- 8< ---------------------
--------------------- 8< ---------------------
I'll leave it as an exercise to the reader to write a script to do that. Automatically extracting the bill title should be Fun.
seems to be broken on the "Big Beautiful Bill" right now though :(, I'm taking a a look to see what's going on
https://www.congress.gov/119/bills/hr1/generated/BILLS-119hr...