Pull Request for #42: Improve LaTeX builder

rcalvo12 commented 2 years ago

@jmshapir and @veli-m-andirin could you review this PR when you get a chance?

In this issue, we made changes to improve the LaTeX builder so that it could produce papers from LaTeX files that use BibTeX for references.

Changes:

Edits to build_latex.py
Addition of Template.tex and References.bib to source/paper
Addition of gdp_educ.tex and top_gdp.tex to be referenced by Template
Edits Template.lyx to use References.bib for references
Edits to source/paper/sconscript

For this PR, I think it would work if:

[ ] @veli-m-andirin and @jmshapir review the code and repo changes
[ ] Someone (@veli-m-andirin ?) double checks that test_build_latex.py does not need edits
[ ] @rcalvo12 removes the issue folder and closes the pull

@mcaceresb, if you want to review as well, I can add you as a reviewer.

rcalvo12 commented 2 years ago

Given that all the tests failed/were cancelled, it looks like editing test_build_latex.py will be necessary.

mcaceresb commented 2 years ago

@rcalvo12 Yes, thanks! Will have a look later today. (PS: I can probably help with the failing tests too, given they're failing on Ubuntu and I'm on Linux; ofc feel free to debug in the meantime.)

mcaceresb commented 2 years ago

@rcalvo12 Did you run the tests locally? This doesn't seem to be specific to Linux; the tests as they are ought to fail generally. Just want to check this is the case.

Tagging @jmshapir since I think some of these overlap with how this builder ought to be designed. Some wrinkles I've found so far:

The BibTeX call is evaluated with the same tests as the LaTeX call. The LaTeX tests expect the command-line call to be formatted differently than BibTeX.
There are no BibTeX-specific tests. I see there was a short back and forth in #42 about this, but I'd argue the existing tests are not testing the BibTeX addition. I think we might want to check that the function works: Without a .bib file, with a .bib file, that BibTeX is run on the right target (i.e. build and not source), and that .bbl/.blg files are created if there is a .bib file as input.
There is no BibTeX-specific executable. This is somewhat of a corner case so it may not be necessary (it would only serve to allow users to specify a custom executable, like a custom path or biber).
The tests are designed so multiple calls to subprocess.check_output will fail. I believe the idea behind is is to make sure the builder is invoked only once, so that we can see the tests being run correspond to a single call to the builder. However, in this case the builder makes 4 calls. It should be possible to make multi-line calls but this may not be the best design.

I'll make code-specific comments (if there are any) in my review. LMK if I should help with any of the above before then.

veli-m-andirin commented 2 years ago

Thanks @rcalvo12 for your persistent work here!

I think all my in-line comments are addressed (conditional on @mcaceresb's confirmation where I tagged him). I think it might be good to focus on the few broader points in https://github.com/JMSLab/Template/pull/56#issuecomment-1086312640 and https://github.com/JMSLab/Template/pull/56#pullrequestreview-929480958.

I also think it might be a good idea to take @jmshapir's and @mcaceresb's opinion about the following point in https://github.com/JMSLab/Template/pull/56#pullrequestreview-929480958:

I wonder how we'd like to deal with cases where we have additional files that have references to the main text, such as an online Appendix. In that case, In these cases the order in which we compile the files would matter.

I am curious if we should have a builder that accounts for these cases. In theory, we can have cross-references across many files, and I am not sure about the architecture of a builder that can allow for an arbitrary number of auxiliary files such as online appendices. If we think that we should aim to account for the "most usual structure", then things might get simpler. If the most common structures are:

A document with no appendices.
A document with one appendix.

then a conditional logic to cover these would probably be simple. If we agree to have appendices in the same file as the main text as a lab practice, then we won't even need a conditional logic.

jmshapir commented 2 years ago

Thanks @veli-m-andirin! Regarding the issue of multiple TeX files that you highlight in https://github.com/JMSLab/Template/pull/56#issuecomment-1099613997, my suggestion is to leave this aside for the purpose of this pull.

If no one has a current use case, we can perhaps add it to the roadmap as a feature request.

If, now or later, either we in the lab or one of the developers has a use case, we can open an issue to tackle it. (I suspect there may be a way to handle arbitrary numbers of files but we might need to add a metadata file that tells the builder the order in which to compile. And we could always start with the simplest case as you say.)

Does that sound good?

veli-m-andirin commented 2 years ago

Thanks @jmshapir! This sounds good to me. To be precise, the use case I had in mind was based on EventStudy where we had an appendix that cross-referenced the main text where the order of compilation mattered. That being said, I agree that having a builder that can compile a single file with a bibliography correctly would be a good first step.

rcalvo12 commented 2 years ago

If, now or later, either we in the lab or one of the developers has a use case, we can open an issue to tackle it. (I suspect there may be a way to handle arbitrary numbers of files but we might need to add a metadata file that tells the builder the order in which to compile. And we could always start with the simplest case as you say.)

To be precise, the use case I had in mind was based on EventStudy where we had an appendix that cross-referenced the main text where the order of compilation mattered. That being said, I agree that having a builder that can compile a single file with a bibliography correctly would be a good first step.

@jmshapir I would be fine either way we do this. I can take a look at a simple appendix cross-reference case and see if I can come up with a solution given the EventStudy use case @veli-m-andirin pointed out (Maybe budget no more than a couple of hours to look at it). I am also fine leaving this for another issue down the road.

jmshapir commented 2 years ago

Thanks @veli-m-andirin @rcalvo12!

My instinct is to leave the multi-TeX-file case for another issue.

mcaceresb commented 2 years ago

@jmshapir Should tests be tackled in this PR or a later issue? If the latter, two notes:

The LaTeX tests clash with the BibTeX addition, but this is probably fine to tackle later.
There are no BibTeX specific tests. I think it's probably fine do this later as well so long as the builder has been tested non-formally.

LMK. My code comments are addressed but not the test comments I made here. However, if the tests are for a follow-up I'll approve as well.

@rcalvo12 @veli-m-andirin fyi

jmshapir commented 2 years ago

@jmshapir Should tests be tackled in this PR or a later issue?

@mcaceresb thanks. My instinct is that, before we merge this pull, it's probably good to have at least some minimal working tests for the functionality we've implemented here, including TeX+BibTeX. Let me know if that answers or if we should make a more fine-grained decision.

@rcalvo12 fyi.

mcaceresb commented 2 years ago

@jmshapir Should tests be tackled in this PR or a later issue?

@mcaceresb thanks. My instinct is that, before we merge this pull, it's probably good to have at least some minimal working tests for the functionality we've implemented here, including TeX+BibTeX. Let me know if that answers or if we should make a more fine-grained decision.

@rcalvo12 fyi.

@jmshapir Sounds good.

@rcalvo12 If you know how to implement the above great; otherwise I am happy to go into more detail about how the tests here are structured.

rcalvo12 commented 2 years ago

@rcalvo12 If you know how to implement the above great; otherwise I am happy to go into more detail about how the tests here are structured.

@mcaceresb I spent some time looking over it, and I think some more detail about how the tests are structured would be extremely useful. Thanks!

mcaceresb commented 2 years ago

@rcalvo12

How the tests are structured

The tests are based on the unittest module, which you cans see as we are basing the test classes on unittest.TestCase.
In general to add a test we define a function named def test_... with the structure:
```
@[patch]
def test_[test](self, mock_[mock]):
    # test code
```

This much I reckon is clear from reading the code, but I think figuring out what is happening beyond this becomes difficult. Let me explain this by dissecting the parts of the first LaTeX test:

@subprocess_patch
def test_default(self, mock_check_output):
    '''
    Test that build_latex() behaves correctly when provided with
    standard inputs.
    '''
    mock_check_output.side_effect = fx.latex_side_effect
    target = 'build/latex.pdf'
    helpers.standard_test(self, build_latex, 'tex',
                          system_mock = mock_check_output,
                          source = ['test_script.tex'],
                          target = target)
    self.assertTrue(os.path.isfile(target))

Why do all tests have a patch decorator and mock output? The first key thing to know is that we can only test the python code with the automated unit tests. In other words, we cannot in general run the entire repository (proprietary software such as Matlab and Stata is not available on-demand in gibhut servers; while we could run LaTeX, LyX, and R tests, since they are open source, for now we are only testing the python code itself).

The solution is to "pretend" that we are running them without actually invoking the command-line call. The @ decorator patches the system call so that it does not actually get invoked by the system; instead it gets invoked in a pretend testing environment. The mock function is to generate pretend output so we can check whether the python code is doing what it's supposed to (this gets generated in lieu of actual output).
@subprocess_patch in this case patches the calls to check_output made by the builder and replaces them with mock calls.
mock_check_output produces mock output based on a side_effect; these are program-specific and defined in _side_effects.py.
The rest of the code is meant to check a particular aspect of the builder works as expected. In this case, standard_test is run, which is defined in _test_helpers.py. This calls the builder, redirects the log file, and runs the side-effect.
Finally, we check what we want. In this case, we check the target file was created (this happens in the side-effect, which parses the function call using a pre-defined regex; see commandd_match() for details).

High-level issues

These should be resolved before coding any new tests:

At the moment the standard tests call system_mock.assert_called_once() to check that the builder was only called once (since this is standard for all programs except LaTeX). We need to decide whether to make a LaTeX-specific exception or collapse the three LaTeX calls to a single check_output call.
- My vote is to make a LaTeX-specific exception; we'd need to figure out how to have system_mock assert it was called exactly thrice for .tex files. Unsure how to do this but I assume it is possible. LMK if you do not find it in the docs.
- While it's definitely possible to do a single check_output call for LaTeX with os-specific switches, the main drawback is that we'd have to modify the LaTeX regex, which I don't think is a good idea. This would be the case even if we found an OS-independent way to do it I think (unless check_output has a "repeat command" switch),
BibTeX cannot be checked with the current LeTeX tests for two reasons:
- The LaTeX regex could not match the BibTeX command.
- check_output would be called four times instead of three with BibTeX.

Next steps

This is my own take:

Figure out how to make the LaTeX tests pass. This should require only adding the switch specific to LaTeX to assert three calls to check_output.
Decide how to code the BibTeX tests. There are two choices:
- Have a BibTeX-specific file.
- Add it to the LaTeX tests.
My vote is for the latter. In either case we need to code a BibTeX side-effect. For this step, it will be sufficient to code an empty BibTeX side-effeect and add logic in the LaTeX side-effect to switch to BibTeX when this is called instead of LaTeX. LMK if you have trouble doing this (it's fine for the side-effect to be empty since none of the LaTeX tests should call it).

Once that is done, if both are correct, we can:

Add a single BibTeX test, whcih just checks the function calls are as expected. This is the hardest step.
- You have to now code a BibTeX side effect that creates a .blg and .bbl file based on the BibTeX command.
- The BibTeX command has to be parsed by a BibTeX-specific regex, which you will have to code an add to commandd_match.
- There should be another switch so check_output is asserted to have been called 4 times (not 3).
- I can also explain the side-effect in detail when you get to this step.

I think this will be plenty for this issue. You can add a test_bibtex_basic function to the LaTeX tests and add a .bib file to the dependenties, which should activate the BibTeX. This tests:

That the builder without .bib is correct (the rest of the LaTeX tests would do this).
The builder with .bib is correct (this new BibTeX tests would do this).

The other two tests to add would be:

Test multiple .bib files.
Test the output having a different name than the input.

But I think 1-3 above is already plenty for this PR.

rcalvo12 commented 2 years ago

@mcaceresb Thank you for the incredibly useful summary above. I tried my hand at implementing the first step so that it asserts 3 calls if latex is specified. Any feedback would be appreciated.

mcaceresb commented 2 years ago

@mcaceresb Thank you for the incredibly useful summary above. I tried my hand at implementing the first step so that it asserts 3 calls if latex is specified. Any feedback would be appreciated.

@rcalvo12 Nice! The only issue that remained was that LaTeX cleanup is invoked on failure and expects add_out_name to have been run, so I switched it to L32 (above execute_system_call). You can see all the tests are now passing. This should always be the case now (you can run tests locally using pytest).

Happy to continue to help as you make progress on coding the BibTeX test.

rcalvo12 commented 2 years ago

@mcaceresb Thanks again for your help here! I'm trying to work on this step now:

In either case we need to code a BibTeX side-effect. For this step, it will be sufficient to code an empty BibTeX side-effect and add logic in the LaTeX side-effect to switch to BibTeX when this is called instead of LaTeX. LMK if you have trouble doing this (it's fine for the side-effect to be empty since none of the LaTeX tests should call it).

I was able to add an empty side-effect but I am unclear on how to add the logic within the LaTeX side effect for switching. Maybe you can expand on this a bit more?

mcaceresb commented 2 years ago

I was able to add an empty side-effect but I am unclear on how to add the logic within the LaTeX side effect for switching. Maybe you can expand on this a bit more?

@rcalvo12 You can write a bibtex regex in command_match(), then do something like

    bibmatch = helpers.command_match(command, 'bibtex')
    if bibmatch:
        bibtex_side_effect(*args, **kwargs)
        return

    match = helpers.command_match(command, 'pdflatex')
    #  rest of LaTeX side-effect

Happy to also give guidance on writing the regex if you need it! LMK.

rcalvo12 commented 2 years ago

@mcaceresb I tried my hand at what you suggested here, would appreciate feedback. Thanks!

mcaceresb commented 2 years ago

@rcalvo12 One excellent resource is regex101. You can input your regex and sample text to see if it matches. If you select python as the flavor, you can see

(?P<executable>\w+)\s+(?P<target>-\w+\s+\S+)?\s*(?P<log_redirect>\>\s*[\.\/\\\w]+\.\w+)?

does not quite partition

bibtex target_file > sconscript.log

correctly. While the expression matches, the groups are not divided in the way we want. You can see the issue is with target, which was based on a regex for an option rather than a source or target. (Note other options for sources/targets assume a file extension.) You can play around with the website, or see what worked for me:

spoiler

``` (?P\w+)\s+(?P[\.\/\\\w]+)?\s*(?P\>\s*[\.\/\\\w]+\.\w+)? ``` (This accounts for slashes in the file name as well. Note you'd have to put this back into the format in `_test_helpers.py`.)

PS: I do realize I'm giving you feedback in a very piece-meal way. I thought it would be helpful but LMK in case you want me to more actively help in finishing to code this: I don't want to prompt you to spend too much time on it if it's not helpful for you.

rcalvo12 commented 2 years ago

@mcaceresb After spending some time on this issue over the past handful of weeks, I am not much closer on the implementation of the bibtex test and I think it would probably be more efficient to bring you in on the implementation. Maybe we can set up a call to discuss, so I can learn from you more directly on this.

mcaceresb commented 2 years ago

@rcalvo12 Messaged you; we'll set something up and hopefully close this PR.

@jmshapir fyi

mcaceresb commented 2 years ago

@rcalvo12 Following our meeting, I pushed the changes we discussed implementing:

The mock bibtex test.
The nsyscalls argument to standard_test to account for LaTeX and bibtex running the builder multiple times.
An assertion that the check_bib function works when .bib files are specified.

The automated python tests are all passing and I've approved the PR.

@jmshapir fyi

rcalvo12 commented 2 years ago

Thanks @mcaceresb for that very productive meeting! In https://github.com/JMSLab/Template/pull/56/commits/58010a8070f96f59c224c82cb75b2f57f39fc443, I reran the code and tests on my end and I also added latex to the list of envs in Sconstruct. Everything appears to be working as intended.

@jmshapir fyi.

rcalvo12 commented 2 years ago

Removed issue sub in https://github.com/JMSLab/Template/pull/56/commits/537fcfd63c7dd91f7b0ec4345c3c3bae428c2984.

rcalvo12 commented 2 years ago

@veli-m-andirin Did you have any other changes here? If not can you approve?

veli-m-andirin commented 1 year ago

@rcalvo12, feel free to ignore the comment about the pdf for talk, it seems to be done on purpose!

jmshapir commented 1 year ago

@rcalvo12, feel free to ignore the comment about the pdf for talk, it seems to be done on purpose!

Thanks @veli-m-andirin! See https://github.com/JMSLab/Template/issues/66#issue-1378677265 for context.

rcalvo12 commented 1 year ago

Summary in https://github.com/JMSLab/Template/issues/42#issuecomment-1256343749.

JMSLab / Template