New analysis testing utilities by dwhswenson · Pull Request #1068 · openpathsampling/openpathsampling

dwhswenson · 2021-09-10T13:30:54Z

Writing tests for analysis code is hard. I always tell people that the input to most analysis methods should be a list of steps. But how do you mock up a sequence of steps?

This PR adds some utilities in tests/analysis/utils/ to help create mock simulation results. These utilities have their own test in that same directory.

This was developed because the headache of writing such tests has been blocking both #1026 and #1048 for months. I decided to make a separate task out of creating tools to solve that generic problem.

The approach used here is to use as much of the internal OPS machinery as possible, instead of just reproducing the data it generates (as was done in the OPSPiggybacker). Here, we make use of unittest.patch to inject the data that OPS would have created via simulation. This means that if internal structure of movers (or of Details they return) changes, this code should automatically use that new behavior.

The scope of this PR is only aiming to support one-way shooting moves, path reversal moves, and repex moves, as well as wrapping as with an OrganizeByMoveGroup global strategy. Support for other movers will be added as needed in the future.

codecov · 2021-09-12T15:59:49Z

Codecov Report

Merging #1068 (bf098d3) into master (9c525b1) will not change coverage.
The diff coverage is n/a.

@@           Coverage Diff           @@
##           master    #1068   +/-   ##
=======================================
  Coverage   81.56%   81.56%           
=======================================
  Files         140      140           
  Lines       15452    15452           
=======================================
  Hits        12604    12604           
  Misses       2848     2848

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 9c525b1...bf098d3. Read the comment docs.

dwhswenson · 2021-09-12T19:11:48Z

This is ready for review and comment. I will leave it open for at least 48 hours, merging no earlier than Tue 14 Sep 20:00 GMT (16:00 my local).

Coverage doesn't show, since the utils code in inside the tests directory. However, local tests showed coverage for everything except the ImportError that preserves Py27 compatibility.

For what it's worth, I'm already playing with this in a branch where I've mixed this stuff and #1026. Writing tests for #1026 is so easy now.

sroet

Looks great,

2 critical points:

I am not convinced "make_trajectory" does what it is supposed to,
I would like to have access to the final_state in the make_tis_trajectory

Also pointed out the issues my development environment was complaining about, feel free to ignore those

sroet · 2021-09-14T13:52:10Z

+@pytest.fixture
+def tis_network():
+    paths.InterfaceSet._reset()
+    cv = paths.FunctionCV("x", lambda s: s.xyz[0][0])


I am not sure about "import order", but would these fixture work for functions that require monkey (un)patching?

The CVs used here are not compatible with SimStore, so this should not be used if the intent is to also test SimStore storage.

Of course, that will change in 2.0 when paths.FunctionCV is paths.experimental.storage.CollectiveVariable.

sroet · 2021-09-14T14:10:51Z

+@pytest.mark.parametrize('bounds', [(-0.1, 1.0), (-0.1, 0.5)])
+def test_make_trajectory(bounds):
+    lower, upper = bounds
+    traj = make_trajectory(lower, upper)
+    xvals = traj.xyz[:,0,0]
+    assert upper <= max(xvals) < upper + 0.1
+    assert lower <= min(xvals) < lower + 0.1
+    expected_len = round((upper - lower) / 0.1) + 1
+    assert len(traj) == expected_len


Just an FYI: be careful about using round it uses (the correct) bankers' rounding, which might bite you if you do not expect it. (for the current values this test is fine, but for `bounds=(-0.55, 0) it is not)

>>> round(2.5) 2 >>> round(1.5) 2

Also, this test fails if upper is not divisible by 0.1 (bounds=(0, 0.45) fails as does bounds=(0, 0.55)

sroet · 2021-09-14T14:52:15Z

+    xvals = np.arange(lower, upper + 0.01, 0.1) + np.random.random() * 0.1
+    return make_1d_traj(xvals)


There is no guarantee that upper is crossed here

say lower=0, upper=0.09 then the xvals become [0+epsilon], which only reaches 0.09 10% of the time.

sroet · 2021-09-14T15:13:12Z

+
+def make_tis_trajectory(cv_max, lower_bound=-0.1):
+    """Make a TIS trajectory with a given maximum x value.
+
+    """
+    increasing = make_trajectory(lower_bound, cv_max)
+    if cv_max >= 1.0:
+        return increasing
+    else:
+        decreasing = increasing.reversed[1:]
+        return increasing + decreasing


The cutoff between switching from increasing to both should probably be user configurable, also 2 blank lines expected and only 1 found

Suggested change

def make_tis_trajectory(cv_max, lower_bound=-0.1):

"""Make a TIS trajectory with a given maximum x value.

"""

increasing = make_trajectory(lower_bound, cv_max)

if cv_max >= 1.0:

return increasing

else:

decreasing = increasing.reversed[1:]

return increasing + decreasing

def make_tis_trajectory(cv_max, lower_bound=-0.1, final_state=1.0):

"""Make a TIS trajectory with a given maximum x value.

"""

increasing = make_trajectory(lower_bound, cv_max)

if cv_max >= final_state:

return increasing

else:

decreasing = increasing.reversed[1:]

return increasing + decreasing

dwhswenson · 2021-11-07T13:24:44Z

Sorry that this has been on pause for so long. I'll go ahead and take the PEP8 improvements. However, some of the other comments from review made it clear to me that my code was not communicating its purpose well enough, because that purpose was not obvious to @sroet. I've been trying to figure out what the best way to make the purpose more clear would be -- but rather than try to predict was others will think, it's better to just ask!

Fundamental ideas that I don't think have been communicated clearly enough here are mainly about the make_trajectory/make_tis_trajectory methods:

This is a specific set of fixtures intended for analysis, and they are intended to be used together. Specifically, the make_trajectory method here is only intended to be used with the specific network fixture created here. Some of @sroet's concerns deal with the issue that someone might use make_tis_trajectory for a different network.
The trajectories created are designed to fit a style I've tended to use, which discretizes at intervals of 0.1. A number of @sroet's concerns deal with my implicit assumption that the inputs are at intervals of 0.1.

Here are a few options I've considered on how to make these things clearer to others reading the code. If you have options, please suggest!

Regarding point 1:

document it in docstrings
convert these separate but related fixtures to a single instance of some class, where, e.g., network and scheme might be properties, and make_trajectory is a method specific to the fixture

Regarding point 2:

docstrings
convert make_trajectory into something that takes integers in the range [-1, 10], which makes the discretization more explicit. This might also involve changing the network itself to actually use the state borders at 0.0, 10.0 instead of 0.0, 1.0.

These are the things I thought of; again, I'm very open to other options. The code in this PR is very much intended for re-use by contributors (please let me make it easier for you to write tests!), so it is extremely important that it is easily understood.

I'm somewhat leaning toward the more complicated solution in both cases (especially point 1, where it enables me to encapsulate the make_trajectory method with the fixture)

Co-authored-by: Sander Roet <sanderroet@hotmail.com>

sroet · 2021-11-08T11:17:27Z

About point 1:

I agree that if things are only to be used together, they should be grouped together in 1 object (instance or class)

About point 2:
If it is discrete I would force it to indeed only accept and work in integer space

This links the make_tis_trajectory method to a fixture class that depends on the details of the setup. In this way, there's less of an implicit dependence on a certain kind of setup. Also switches it so that the input values for trajectories here are based on integers instead of floats, which should avoid confusion. Probably still need significant docs here, but the tests seem to pass locally, so that's good.

dwhswenson · 2021-12-09T20:04:14Z

I ended up doing some pretty significant changes since the last review, so this is probably worth a complete re-review, as opposed to just reviewing changes since last review. I re-organized according to the discussion in #1068 (comment) and following.

Some of the things that are currently in tests/analysis/utils might be moved to tests/utils in the future, because I think they might be useful in testing more broadly (using a consistent network for 2-state TPS and 2-state TIS setups instead of custom setups for every test class).

Here's what's now included in this PR:

tests/analysis/utils/fixture_classes.py: This includes the core fixture classes and tools to create fixtures within a conftest.py. These now group the make_trajectory/make_tis_trajectory in an object with the network and scheme, so it should be clear that these are all related.
tests/analysis/utils/test_fixture_classes.py: Tests for the stuff in fixture_classes. Note that this actually tests by way of the specific setups used in conftest.py, so technically is also testing that.
tests/analysis/utils/mock_movers.py: This is the core of mocking out pathmovers for analysis. Code here is largely unchanged from the previous, but moved to a separate and more clearly-named module.
tests/analysis/utils/test_mock_movers.py: Tests for above. This is largely unchanged from previous, except in that the range (-0.1, 1.0) was changed to (-1, 10) and trajectory inputs are now all integers.
tests/analysis/conftest.py: Fixtures that will be the most common setups.

sroet

Some extra imports, and some possible corrections on the docs. LGTM otherwise

Co-authored-by: Sander Roet <sanderroet@hotmail.com>

sroet

LGTM, feel free to merge on green

dwhswenson added 14 commits September 9, 2021 01:23

Start to analysis test utils

1c2915e

some tests for analysis/test_test_utils

1453fe6

Basic framework set up; tests for pathreversal

04963b8

tests for the repex mocks

a9ba1b5

add mock for RandomChoiceMover

eac4f37

tests for wrap_org_by_group

8046efd

add support for lower bounds in make_trajectory

f3ee596

forward shooting mock works/is tested

9267f2e

fixes for mock shooting movers

b3942bf

unify forward and backward shooting tests

37bc422

tests for forcing shooting acceptance

2397422

finish up primary test suite

9abf09a

clean up errorsi/error testing in analysis utils

8f8301a

clean up make_trajectory; non-singleton error

3cc880e

docstrings

31c7668

dwhswenson changed the title ~~[WIP] New analysis testing utilities~~ New analysis testing utilities Sep 12, 2021

dwhswenson marked this pull request as ready for review September 12, 2021 16:18

sroet requested changes Sep 14, 2021

View reviewed changes

Apply suggestions from code review

bfb190b

Co-authored-by: Sander Roet <sanderroet@hotmail.com>

dwhswenson added the enhancement label Nov 7, 2021

dwhswenson added this to the 1.6 milestone Nov 8, 2021

dwhswenson added 4 commits December 7, 2021 01:25

reorg, lots of docstrings for analysis test utils

168f4e5

minor cleanup; mainly style

fe77fd4

Merge remote-tracking branch 'upstream/master' into test-analysis-utils

4c2caf1

dwhswenson requested a review from sroet December 9, 2021 20:04

sroet approved these changes Dec 10, 2021

View reviewed changes

Comment thread openpathsampling/tests/analysis/conftest.py Outdated

Comment thread openpathsampling/tests/analysis/utils/mock_movers.py Outdated

Comment thread openpathsampling/tests/analysis/utils/mock_movers.py Outdated

Apply suggestions from code review

bf098d3

Co-authored-by: Sander Roet <sanderroet@hotmail.com>

sroet approved these changes Dec 10, 2021

View reviewed changes

dwhswenson merged commit ffe5fa0 into openpathsampling:master Dec 10, 2021

dwhswenson deleted the test-analysis-utils branch December 10, 2021 14:19

sroet mentioned this pull request Dec 10, 2021

Actually check for CVDefinedVolume.lambda_max == "inf" #1092

Merged

dwhswenson mentioned this pull request Jan 4, 2024

Release 1.6.0 #1138

Merged

		xvals = np.arange(lower, upper + 0.01, 0.1) + np.random.random() * 0.1
		return make_1d_traj(xvals)

Uh oh!

Conversation

dwhswenson commented Sep 10, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov Bot commented Sep 12, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

dwhswenson commented Sep 12, 2021

Uh oh!

sroet left a comment

Choose a reason for hiding this comment

Uh oh!

sroet Sep 14, 2021

Choose a reason for hiding this comment

Uh oh!

dwhswenson Nov 7, 2021

Choose a reason for hiding this comment

Uh oh!

sroet Sep 14, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sroet Sep 14, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

sroet Sep 14, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dwhswenson commented Nov 7, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sroet commented Nov 8, 2021

Uh oh!

dwhswenson commented Dec 9, 2021

Uh oh!

sroet left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sroet left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

dwhswenson commented Sep 10, 2021 •

edited

Loading

codecov Bot commented Sep 12, 2021 •

edited

Loading

sroet Sep 14, 2021 •

edited

Loading

dwhswenson commented Nov 7, 2021 •

edited

Loading