refactor: Make hypotest return CLs as 0-d tensor #944

matthewfeickert · 2020-07-14T06:03:28Z

Description

Resolves #714

Results in the following behavior for the CLs values

>>> import pyhf
>>> model = pyhf.simplemodels.hepdata_like(signal_data=[12.0, 11.0], bkg_data=[50.0, 52.0], bkg_uncerts=[3.0, 7.0])
>>> data = [51, 48] + model.config.auxdata
>>> test_mu = 1.0
>>> CLs_obs, CLs_exp = pyhf.infer.hypotest(test_mu, data, model, qtilde=True, return_expected=True)
>>> CLs_obs
array(0.05251554)
>>> CLs_exp
array(0.06445521)

previous behavior was

>>> import pyhf
>>> model = pyhf.simplemodels.hepdata_like(signal_data=[12.0, 11.0], bkg_data=[50.0, 52.0], bkg_uncerts=[3.0, 7.0])
>>> data = [51, 48] + model.config.auxdata
>>> test_mu = 1.0
>>> CLs_obs, CLs_exp = pyhf.infer.hypotest(test_mu, data, model, qtilde=True, return_expected=True)
>>> CLs_obs
array([0.05251554])
>>> CLs_exp
array([0.06445521])

Checklist Before Requesting Reviewer

Tests are passing
"WIP" removed from the title of the pull request
Selected an Assignee for the PR to be responsible for the log summary

Before Merging

For the PR Assignees:

Summarize commit messages into a comprehensive review of the PR

* Have hypotest return the CLs as a 0-d tensor
   - Important for fully differential likelihoods
* Update the docs to reflect changes
* Update tests to use return type of 0-d tensor

matthewfeickert · 2020-07-14T06:05:07Z

Still need to correct PyTorch's behavior of returning a Tensor.

lukasheinrich · 2020-07-14T11:57:28Z

this will likely require a major version bump

kratsg · 2020-07-14T12:22:27Z

this will likely require a major version bump

you want this to be v1.0.0? or v0.5.0?

lukasheinrich · 2020-07-14T14:49:38Z

meh as long as we're still 0.X we can do minor bumps I guess

lukasheinrich · 2020-07-17T09:13:22Z

As part of this we probably need to fix this as well

import pyhf
import numpy as np
import jax
import torch
import tensorflow as tf

print("numpy")
print(np.asarray(0.1))
pyhf.set_backend("numpy")
print(pyhf.tensorlib.astensor(0.1))

print("\njax")
print(jax.numpy.asarray(0.1))
pyhf.set_backend("jax")
print(pyhf.tensorlib.astensor(0.1))

print("\ntorch")
print(torch.tensor(0.1))
pyhf.set_backend("pytorch")
print(pyhf.tensorlib.astensor(0.1))

print("\ntensorflow")
print(tf.constant(0.1))
pyhf.set_backend("tensorflow")
print(pyhf.tensorlib.astensor(0.1))

numpy
0.1
[0.1]

jax
0.1
[0.1]

torch
tensor(0.1000)
tensor([0.1000])

tensorflow
tf.Tensor(0.1, shape=(), dtype=float32)
tf.Tensor([0.1], shape=(1,), dtype=float32)

matthewfeickert · 2020-07-17T14:54:07Z

As part of this we probably need to fix this as well

Ah, I guess so. I think we did this in the past to intentionally enforce uniformity across all backends, but I guess they're all the same now and so enforcing shape isn't helping.

import pyhf
import numpy as np
import torch

print("numpy")
example = np.asarray(0.1)
print(f"example {example} is a {type(example)} with shape {example.shape}")
pyhf.set_backend("numpy")
example = pyhf.tensorlib.astensor(0.1)
print(f"example {example} is a {type(example)} with shape {example.shape}")

print("\ntorch")
example = torch.tensor(0.1)
print(f"example {example} is a {type(example)} with shape {example.shape}")
pyhf.set_backend("pytorch")
example = pyhf.tensorlib.astensor(0.1)
print(f"example {example} is a {type(example)} with shape {example.shape}")

numpy
example 0.1 is a <class 'numpy.ndarray'> with shape ()
example [0.1] is a <class 'numpy.ndarray'> with shape (1,)

torch
example 0.10000000149011612 is a <class 'torch.Tensor'> with shape torch.Size([])
example tensor([0.1000]) is a <class 'torch.Tensor'> with shape torch.Size([1])

So we should probably just fix this first in a separate PR.

matthewfeickert · 2020-08-03T05:28:37Z

@lukasheinrich @kratsg Related to Issue #974 and my last question in Issue #714, do we want this to return a 0-d Tensor or do we want a float?

codecov · 2020-08-03T15:17:16Z

Codecov Report

Merging #944 into master will decrease coverage by 0.00%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master     #944      +/-   ##
==========================================
- Coverage   96.70%   96.70%   -0.01%     
==========================================
  Files          59       59              
  Lines        3338     3337       -1     
  Branches      467      468       +1     
==========================================
- Hits         3228     3227       -1     
  Misses         69       69              
  Partials       41       41

Flag	Coverage Δ
#unittests	`96.70% <100.00%> (-0.01%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
src/pyhf/cli/infer.py	`100.00% <ø> (ø)`
src/pyhf/infer/__init__.py	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 6e5b7bb...5cd8324. Read the comment docs.

matthewfeickert · 2020-08-03T15:19:40Z

The current state of the PR results in the following

import pyhf
model = pyhf.simplemodels.hepdata_like(
    signal_data=[12.0, 11.0], bkg_data=[50.0, 52.0], bkg_uncerts=[3.0, 7.0]
)
data = [51, 48] + model.config.auxdata
test_mu = 1.0
for backend in ["numpy", "jax", "tensorflow", "pytorch"]:
    pyhf.set_backend(backend)
    CLs_obs, CLs_exp = pyhf.infer.hypotest(
        test_mu, data, model, qtilde=True, return_expected=True
    )
    print(
        f"Observed: {CLs_obs} is of type {type(CLs_obs)}, Expected: {CLs_exp} is of type {type(CLs_exp)}"
    )

giving

Observed: 0.052515541856109765 is of type <class 'numpy.ndarray'>, Expected: 0.06445521290832805 is of type <class 'numpy.ndarray'>
Observed: 0.05251554103025729 is of type <class 'jax.interpreters.xla.DeviceArray'>, Expected: 0.06445520998614704 is of type <class 'jax.interpreters.xla.DeviceArray'>
Observed: 0.05251520499587059 is of type <class 'tensorflow.python.framework.ops.EagerTensor'>, Expected: 0.06445307284593582 is of type <class 'tensorflow.python.framework.ops.EagerTensor'>
Observed: 0.052520569413900375 is of type <class 'torch.Tensor'>, Expected: 0.06446529179811478 is of type <class 'torch.Tensor'>

This change in return type should probably result in a release, even though the public API tests still pass.

matthewfeickert · 2020-08-03T17:00:48Z

@phinate @WolfgangWaltenberger can you give feedback on if this will be problematic?

phinate · 2020-08-03T17:21:15Z

@matthewfeickert it’s gonna break neos temporarily, but it’s the smallest fix, so I say just go for it. I’m in the middle of a refactor, so I can add this as part of it.
(Well, I guess we’re actually using pyhf@diffable_json, so it won’t break until that gets rebased)

WolfgangWaltenberger · 2020-08-03T17:38:59Z

@phinate @WolfgangWaltenberger can you give feedback on if this will be problematic?

If with "this" you refer to the change of return types, then no, thats not a problem.

matthewfeickert · 2020-08-03T17:43:40Z

Thanks for the prompt feedback @phinate and @WolfgangWaltenberger. We just wanted to make sure that having the CLs values be 0-d tensors (shape ()) instead of (1,) tensors wouldn't be an issue and make sure that a 0-d tensor is still okay as compared to just going to a float.

So still a ndarray "tensor" but a scalar tensor. That hurts to write.

Return list of tensors instead of tensor of list

matthewfeickert · 2020-08-06T16:20:03Z

@alexander-held Will this affect cabinetry?

alexander-held · 2020-08-06T19:04:30Z

Hypothesis tests are not yet interfaced, so no impact on cabinetry.

docs/examples/notebooks/hello-world.ipynb

matthewfeickert added docs Documentation related API Changes the public API labels Jul 14, 2020

matthewfeickert self-assigned this Jul 14, 2020

matthewfeickert changed the title ~~api: Change CLs to be scalar~~ refactor: Change CLs to be scalar Jul 14, 2020

matthewfeickert mentioned this pull request Jul 17, 2020

Don't enforce a non-empty shape on tensors #954

Closed

matthewfeickert force-pushed the api/make-CLs-scalar branch from 23d45eb to eb55b8c Compare July 25, 2020 17:29

lukasheinrich force-pushed the master branch from 27f35e9 to e55eea4 Compare July 27, 2020 12:16

matthewfeickert force-pushed the api/make-CLs-scalar branch from eb55b8c to 1dd3679 Compare August 2, 2020 18:00

matthewfeickert changed the title ~~refactor: Change CLs to be scalar~~ refactor: Make hypotest return CLs as 0-d tensor Aug 3, 2020

matthewfeickert requested review from kratsg and lukasheinrich August 3, 2020 15:26

matthewfeickert added 17 commits August 6, 2020 11:16

Correct docstring to use scalars

52dccaa

Use scalars in hello world example in README

3cf4164

Revert: Still reshape, but reshape to scalar

9d42aec

So still a ndarray "tensor" but a scalar tensor. That hurts to write.

Note that doing this to create scalars

944041a

Return values are currently split between floats and 0-d tensors

d21082f

Ensure 0-d tensors through astensor

5ed85e8

Correct tests to use shape for 0-d tensors

490c29c

Fix return structure

edfda1e

Return list of tensors instead of tensor of list

Reflect that CLs is 0-d tensor in infer independence

edd7e28

Correct README example

b01b6df

Update hello-world notebook

4d629d7

Apply Black to notebook with Blackbook

cd94b9e

Correct docstring example again for doctest

a24267a

Don't print objects, just repr them for doctests

23a1a52

Don't need to ravel in test_regression as already list

2af028f

Trun each tensor to list item in test_scripts

18a84b4

Match return type in test_validation

3020c69

matthewfeickert force-pushed the api/make-CLs-scalar branch from 0db38bc to 3020c69 Compare August 6, 2020 16:17

kratsg approved these changes Aug 6, 2020

View reviewed changes

lukasheinrich reviewed Aug 6, 2020

View reviewed changes

docs/examples/notebooks/hello-world.ipynb Show resolved Hide resolved

Merge branch 'master' into api/make-CLs-scalar

5cd8324

matthewfeickert requested a review from lukasheinrich August 13, 2020 22:50

kratsg approved these changes Aug 13, 2020

View reviewed changes

kratsg merged commit 6e6f7a1 into master Aug 14, 2020

kratsg deleted the api/make-CLs-scalar branch August 14, 2020 11:25

matthewfeickert mentioned this pull request Jan 20, 2021

Ensure consistent return type of CLs-like / p-value-like values #1268

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: Make hypotest return CLs as 0-d tensor #944

refactor: Make hypotest return CLs as 0-d tensor #944

matthewfeickert commented Jul 14, 2020 •

edited

Loading

matthewfeickert commented Jul 14, 2020

lukasheinrich commented Jul 14, 2020

kratsg commented Jul 14, 2020

lukasheinrich commented Jul 14, 2020

lukasheinrich commented Jul 17, 2020 •

edited by matthewfeickert

Loading

matthewfeickert commented Jul 17, 2020 •

edited

Loading

matthewfeickert commented Aug 3, 2020

codecov bot commented Aug 3, 2020 •

edited

Loading

matthewfeickert commented Aug 3, 2020 •

edited

Loading

matthewfeickert commented Aug 3, 2020

phinate commented Aug 3, 2020

WolfgangWaltenberger commented Aug 3, 2020

matthewfeickert commented Aug 3, 2020

matthewfeickert commented Aug 6, 2020 •

edited

Loading

alexander-held commented Aug 6, 2020

refactor: Make hypotest return CLs as 0-d tensor #944

refactor: Make hypotest return CLs as 0-d tensor #944

Conversation

matthewfeickert commented Jul 14, 2020 • edited Loading

Description

Checklist Before Requesting Reviewer

Before Merging

matthewfeickert commented Jul 14, 2020

lukasheinrich commented Jul 14, 2020

kratsg commented Jul 14, 2020

lukasheinrich commented Jul 14, 2020

lukasheinrich commented Jul 17, 2020 • edited by matthewfeickert Loading

matthewfeickert commented Jul 17, 2020 • edited Loading

matthewfeickert commented Aug 3, 2020

codecov bot commented Aug 3, 2020 • edited Loading

Codecov Report

matthewfeickert commented Aug 3, 2020 • edited Loading

matthewfeickert commented Aug 3, 2020

phinate commented Aug 3, 2020

WolfgangWaltenberger commented Aug 3, 2020

matthewfeickert commented Aug 3, 2020

matthewfeickert commented Aug 6, 2020 • edited Loading

alexander-held commented Aug 6, 2020

matthewfeickert commented Jul 14, 2020 •

edited

Loading

lukasheinrich commented Jul 17, 2020 •

edited by matthewfeickert

Loading

matthewfeickert commented Jul 17, 2020 •

edited

Loading

codecov bot commented Aug 3, 2020 •

edited

Loading

matthewfeickert commented Aug 3, 2020 •

edited

Loading

matthewfeickert commented Aug 6, 2020 •

edited

Loading