feat: fixed_vals #846

cgottard · 2020-04-27T08:48:52Z

fixed_vals argument in infer functions

I propagated to the infer functions the capability of fixing some parameters to a constant the value. The optimizer already implemented this functionality so it was only a matter of interfacing.
I tested the changes running both CLs calculations and MLE fits. The CI succeeds.

Note: before these changes it was possible to perform a MLE fit passing the list of fixed parameters via **kwargs. The same was not true for hypotest. I found that adding an explicit function argument for both cases was appropriate for such a common task.

@kratsg, we discussed the use case of these changed via e-mail. Could you review this MR?

Checklist Before Requesting Reviewer

Tests are passing
"WIP" removed from the title of the pull request
Selected an Assignee for the PR to be responsible for the log summary

Before Merging

For the PR Assignees:

Summarize commit messages into a comprehensive review of the PR

tests/test_backend_consistency.py

lukasheinrich · 2020-04-27T09:56:27Z

Hi @cgottard thanks a lot for this PR. It's been something we wanted to add for some time. I wonder whether we could somehow streamline the API further. As it is, we do a lot of passing around for these items

init values
bounds values
fixed_values

and perhaps we should be rather moving around pdfconfig objects

sb_config = model.make_config()
sb_config.set_poival(1.0)

b_config = model.make_config()
b_config.set_poival(0.0)

pyhf.infer.hypotest(sb_config, data, pdf)

what do you think?

kratsg · 2020-04-27T10:41:50Z

src/pyhf/infer/__init__.py

@@ -17,6 +24,7 @@ def hypotest(
        pdf (~pyhf.pdf.Model): The HistFactory statistical model
        init_pars (Array or Tensor): The initial parameter values to be used for minimization
        par_bounds (Array or Tensor): The parameter value bounds to be used for minimization
+        fixed_vals (list of tuples): Parameters to be held constant and their value


need to be careful, since this will be confusing given the existing poi_test argument.

It can be confusing because fixed_poi_fit is not strictly necessary anymore. Anyway in fixed_poi_fit the mu is fixed via the fixed_vals and if additional parameters are set to constant they're added to the list, see https://github.com/cgottard/pyhf/blob/fixed_vals/src/pyhf/infer/mle.py#L56
So there should be no problem nor ambiguity.

kratsg · 2020-04-27T10:42:53Z

I think we should have this in for 0.6.0 because I don't think trying to keep the same API is a good idea.

lukasheinrich · 2020-04-27T10:50:44Z

@cgottard would you be up for shepherding a larger change to the API that enables this feature?

cgottard · 2020-04-27T10:58:23Z

Dear all,

thanks for the feedback. I was addressing @matthewfeickert's comment about the test but locally pytest is not running as it should and I see the errors only from the CI.

Anyway, I am ok with changing the API as @lukasheinrich suggested. I agree that it makes everything clearer and more transparent. In the meantime I think we can keep this MR open to discuss the progress, then we'll see if we want to close it and create a new one using a new feature branch name.

kratsg · 2020-04-27T11:05:12Z

Anyway, I am ok with changing the API as @lukasheinrich suggested. I agree that it makes everything clearer and more transparent. In the meantime I think we can keep this MR open to discuss the progress, then we'll see if we want to close it and create a new one using a new feature branch name.

I think this PR should go in as close to how it is, but with tests and coverage, as part of 0.5.0 -> 0.6.0 (we don't want very large PRs) and you've pointed out this is a relatively quick change. The API is backwards compatible here with it. Then we change the API in another PR. Having very large PRs in general scares me.

lukasheinrich · 2020-04-27T11:07:05Z

that sounds good to me.

…

On Mon, Apr 27, 2020 at 1:05 PM Giordon Stark ***@***.***> wrote: Anyway, I am ok with changing the API as @lukasheinrich <https://github.com/lukasheinrich> suggested. I agree that it makes everything clearer and more transparent. In the meantime I think we can keep this MR open to discuss the progress, then we'll see if we want to close it and create a new one using a new feature branch name. I think this PR should go in as close to how it is, but with tests and coverage as part of 0.5.0 -> 0.6.0 (we don't want very large PRs) and you've pointed out this is a relatively quick change. The API is backwards compatible here with it. Then we change the API in another PR. Having very large PRs in general scares me. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#846 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AARV6A3NFH6QCKWE7NH6IVTROVRHPANCNFSM4MRXAMRA> .

cgottard · 2020-04-27T11:39:49Z

Ok, I'll ping you when this is ready. I implemented a new background uncertainty in test_backend_consistency which is then fixed and shifted.
The syntax is correct but I see AssertionErrors that I need to investigate. I'll fix my local pytest first, so to avoid all these useless commits.

…xed_vals

cgottard · 2020-05-08T13:45:14Z

Dear all,

I cloned the repo from master and run locally the test_backend_consistency.py. As it is everything succeds.
Then I changed the statistical model to match what I have in this MR.

Model

source = generate_source_static(n_bins)
bkg_unc_5pc_up = [x + 0.05 * x for x in source['bindata']['bkg']]
bkg_unc_5pc_dn = [x - 0.05 * x for x in source['bindata']['bkg']]
signal_sample = {
    'name': 'signal',
    'data': source['bindata']['sig'],
    'modifiers': [{'name': 'mu', 'type': 'normfactor', 'data': None}],
}

background_sample = {
    'name': 'background',
    'data': source['bindata']['bkg'],
    'modifiers': [
        {
            'name': 'uncorr_bkguncrt',
            'type': 'shapesys',
            'data': source['bindata']['bkgerr'],
        },
        {
            'name': 'norm_bkgunc',
            'type': 'histosys',
            'data': {'hi_data': bkg_unc_5pc_up, 'lo_data': bkg_unc_5pc_dn,},
        },
    ],
}

and pytorch returns a result slightly above tolerance.

FAILED test_backend_consistency.py::test_hypotest_q_mu[normal-500_bins] - assert False FAILED test_backend_consistency.py::test_hypotest_q_mu[inverted-500_bins] - assert False
because
array([0.00000000e+00, 1.59306155e-03, 1.06477109e-02, 2.60919799e-06]) < 0.01.all array([0.00000000e+00, 1.59260676e-03, 1.06472520e-02, 2.14421824e-06]) < 0.01.all

Then, I updated my feature branch to master and run the test with the fixed values:

FAILED tests/test_backend_consistency.py::test_hypotest_q_mu[none-normal-500_bins] - assert False FAILED tests/test_backend_consistency.py::test_hypotest_q_mu[none-inverted-500_bins] - assert False
because

array([0.00000000e+00, 1.59306155e-03, 1.06477109e-02, 2.60919799e-06]) < 0.01.all array([0.00000000e+00, 1.59260676e-03, 1.06472520e-02, 2.14421824e-06]) < 0.01.all

Discrepancies among tensors are gone. Possibly because I am using a conda env correctly configured for cuda and TF. And yes, I am running from my branch as I can verify from the stdout and already the "[none-" in the name of the test.

Given the small excess in the tolerance w.r.t 0.01 shall we increase the tolerance to 1.5%?

EDIT: actually the tensors are also slightly above the 5 permille tolerance for those two cases:
0.008959253804614598 < 0.005.all

EDIT 2: different results are obtained for the same backend if the fit is run on CPU or GPU

GPU - TEST PASSED
Fit with fixed pars [] BINS 500 QMU 3.936610076294528 ORDER True BACKEND <pyhf.tensor.jax_backend.jax_backend object at 0x7fbfda648dd0>
CHECK: Numpy difference [0.00000000e+00 1.59260676e-03 1.06472520e-02 2.14421824e-06]

CI - TEST FAILED
same test from the CI:
array([0.00000000e+00, 4.86420689e-02, 1.06472520e-02, 7.35593908e-09]) < 0.015.all

Can we open a ticket for this? The backend consistency has little to do with this MR which simply propagated a function argument.

lgtm-com · 2020-06-16T17:27:28Z

This pull request introduces 1 alert when merging f610990 into 94b87a8 - view on LGTM.com

new alerts:

1 for Unused local variable

lgtm-com · 2020-06-26T16:06:51Z

This pull request introduces 1 alert when merging 41d6b7f into cb4d37b - view on LGTM.com

new alerts:

1 for Unused local variable

codecov · 2020-07-02T21:54:47Z

Codecov Report

Merging #846 into master will decrease coverage by 0.35%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master     #846      +/-   ##
==========================================
- Coverage   96.64%   96.28%   -0.36%     
==========================================
  Files          59       56       -3     
  Lines        3279     3180      -99     
  Branches      454      438      -16     
==========================================
- Hits         3169     3062     -107     
- Misses         69       75       +6     
- Partials       41       43       +2

Flag	Coverage Δ
#unittests	`96.28% <0.00%> (-0.36%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
src/pyhf/modifiers/normfactor.py	`89.18% <0.00%> (-10.82%)`	⬇️
src/pyhf/infer/calculators.py	`97.95% <0.00%> (-2.05%)`	⬇️
src/pyhf/tensor/jax_backend.py	`94.16% <0.00%> (-1.56%)`	⬇️
src/pyhf/__init__.py	`97.87% <0.00%> (-0.72%)`	⬇️
src/pyhf/tensor/pytorch_backend.py	`98.00% <0.00%> (-0.04%)`	⬇️
src/pyhf/pdf.py	`95.83% <0.00%> (-0.02%)`	⬇️
src/pyhf/cli/cli.py	`100.00% <0.00%> (ø)`
src/pyhf/cli/infer.py	`100.00% <0.00%> (ø)`
src/pyhf/infer/mle.py	`100.00% <0.00%> (ø)`
src/pyhf/cli/__init__.py	`100.00% <0.00%> (ø)`
... and 18 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 27f35e9...ccc9596. Read the comment docs.

matthewfeickert · 2020-08-18T00:26:22Z

@cgottard Sorry to have left you hanging here on this. If you have time can you rebase this so that we can try to get this in for v0.5.2? If you don't one of the core devs can do it.

Edit: @kratsg mentioned that he already talked with you, so he'll take care of this PR and we'll shepard it in. Thank you in advance for your contribution!

kratsg · 2020-09-04T00:01:52Z

Closing in favor of #1051. Thanks a lot @cgottard for the initial push to get this working. I cleaned it up a lot into the new PR and rebased it all.

Will be migrating the tests shortly.

Carlo Alberto Gottardo and others added 8 commits April 26, 2020 14:24

fixed_vals add as argument to infer functions

d0b298c

fixed_vals add as argument to infer functions

c7e470a

fixed_vals add as argument to infer functions

ce106af

tested

7655514

Merge remote-tracking branch 'upstream/master'

0f629f4

black formatting

e04fb7b

added fixed_vals positional argument to test backends

7a4a61b

added fixed_vals positional argument to test backends

9228cd9

cgottard changed the title ~~Fixed vals~~ feat:fixed_vals Apr 27, 2020

cgottard changed the title ~~feat:fixed_vals~~ feat: fixed_vals Apr 27, 2020

matthewfeickert requested review from kratsg, lukasheinrich and matthewfeickert April 27, 2020 09:11

matthewfeickert assigned cgottard Apr 27, 2020

matthewfeickert added the feat/enhancement New feature or request label Apr 27, 2020

matthewfeickert requested changes Apr 27, 2020

View reviewed changes

tests/test_backend_consistency.py Outdated Show resolved Hide resolved

matthewfeickert added the API Changes the public API label Apr 27, 2020

cgottard added 2 commits April 27, 2020 11:45

Adding NP fix and shift in test_backend_consistency

e7513d3

Adding NP fix and shift in test_backend_consistency, make Black happy

f44e6d6

correct histosys data structure in test_backend_consistency

61499d5

kratsg reviewed Apr 27, 2020

View reviewed changes

fix test_backend_consistency

9f36ab8

cgottard added 3 commits May 8, 2020 14:39

Merge branch 'master' into fixed_vals

470779a

Merge branch 'fixed_vals' of https://github.com/cgottard/pyhf into fi…

fdf0439

…xed_vals

sync with master

b79d30e

sync with master, 1pc backend tolerance

5d18588

cgottard and others added 2 commits June 16, 2020 17:31

Minuit returns correlations

eaa393e

relax backend consistency, CI fails do not help now

f610990

max iterations 500k for minuit and scipy

41d6b7f

lukasheinrich mentioned this pull request Jul 16, 2020

optimization: allow setting arbitrary parameters constant #355

Open

alexander-held mentioned this pull request Jul 23, 2020

feat: norm factor settings scikit-hep/cabinetry#62

Merged

Merge branch 'master' into fixed_vals

ccc9596

This was referenced Jul 26, 2020

feat: Bookkeep fixed parameters #989

Merged

Add support for multiple parameters of interest #179

Open

lukasheinrich force-pushed the master branch from 27f35e9 to e55eea4 Compare July 27, 2020 12:16

alexander-held mentioned this pull request Aug 3, 2020

feat: support fixed parameters scikit-hep/cabinetry#82

Merged

matthewfeickert mentioned this pull request Aug 18, 2020

BkgOnly.json file of SUSY-2018-06 analysis giving obs==exp CLs #1032

Open

kratsg mentioned this pull request Sep 3, 2020

feat: Support configuring fixed values automatically from Model #1051

Merged

4 tasks

kratsg closed this Sep 4, 2020

kratsg added the wontfix This will not be worked on label Sep 4, 2020

kratsg mentioned this pull request Oct 8, 2020

docs: Remove fixed parameters caveat for TRexFitter in babel #1093

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: fixed_vals #846

feat: fixed_vals #846

cgottard commented Apr 27, 2020 •

edited

Loading

lukasheinrich commented Apr 27, 2020

kratsg Apr 27, 2020

cgottard Apr 27, 2020

kratsg commented Apr 27, 2020

lukasheinrich commented Apr 27, 2020

cgottard commented Apr 27, 2020

kratsg commented Apr 27, 2020 •

edited by matthewfeickert

Loading

lukasheinrich commented Apr 27, 2020 via email

cgottard commented Apr 27, 2020

cgottard commented May 8, 2020 •

edited

Loading

lgtm-com bot commented Jun 16, 2020

lgtm-com bot commented Jun 26, 2020

codecov bot commented Jul 2, 2020 •

edited

Loading

matthewfeickert commented Aug 18, 2020 •

edited

Loading

kratsg commented Sep 4, 2020

feat: fixed_vals #846

feat: fixed_vals #846

Conversation

cgottard commented Apr 27, 2020 • edited Loading

fixed_vals argument in infer functions

Checklist Before Requesting Reviewer

Before Merging

lukasheinrich commented Apr 27, 2020

kratsg Apr 27, 2020

Choose a reason for hiding this comment

cgottard Apr 27, 2020

Choose a reason for hiding this comment

kratsg commented Apr 27, 2020

lukasheinrich commented Apr 27, 2020

cgottard commented Apr 27, 2020

kratsg commented Apr 27, 2020 • edited by matthewfeickert Loading

lukasheinrich commented Apr 27, 2020 via email

cgottard commented Apr 27, 2020

cgottard commented May 8, 2020 • edited Loading

lgtm-com bot commented Jun 16, 2020

lgtm-com bot commented Jun 26, 2020

codecov bot commented Jul 2, 2020 • edited Loading

Codecov Report

matthewfeickert commented Aug 18, 2020 • edited Loading

kratsg commented Sep 4, 2020

cgottard commented Apr 27, 2020 •

edited

Loading

kratsg commented Apr 27, 2020 •

edited by matthewfeickert

Loading

cgottard commented May 8, 2020 •

edited

Loading

codecov bot commented Jul 2, 2020 •

edited

Loading

matthewfeickert commented Aug 18, 2020 •

edited

Loading