@ellis.eu PhD - visiting @avehtari.bsky.social 🇫🇮
🤔💭 Monte Carlo, probabilistic ML.
Interested in many things relating to probML, keen to learn applications in climate/science.
https://www.branchini.fun/about
"Value-aware Importance Weighting for Off-policy Reinforcement Learning"
proceedings.mlr.press/v232/de-asis...
"Value-aware Importance Weighting for Off-policy Reinforcement Learning"
proceedings.mlr.press/v232/de-asis...
I am free to clean my room by throwing everything inside the closet.
I am free to clean my room by throwing everything inside the closet.
Regarding the paper: I was particularly interested in what they mention as future work as far as I understand
Regarding the paper: I was particularly interested in what they mention as future work as far as I understand
So looking only at k-hat values doesn't fully explain the diff. in performance of two competing estimators.
So looking only at k-hat values doesn't fully explain the diff. in performance of two competing estimators.
The reliability of the two estimators, i.e., of num. and denom., can be *individually* assessed with the k-hat diagnostic of Vehtari et al (2024, JMLR).
The reliability of the two estimators, i.e., of num. and denom., can be *individually* assessed with the k-hat diagnostic of Vehtari et al (2024, JMLR).
Intuitively, reliability of an IS estimator depends on the behavior of the density ratios (weights)
Intuitively, reliability of an IS estimator depends on the behavior of the density ratios (weights)
Come to our poster at the NeurIPS BDU workshop on Saturday - see TL;DR below.
Come to our poster at the NeurIPS BDU workshop on Saturday - see TL;DR below.
explicit effect of dimension on the error !
@nolovedeeplearning.bsky.social
explicit effect of dimension on the error !
@nolovedeeplearning.bsky.social
- the notes on Monte Carlo here: sites.google.com/site/fportie...
- arxiv.org/abs/2102.05407
All have different (complementary) material. The basic concept is super general (maybe too general I suspect @nolovedeeplearning.bsky.social may argue :D ).
- the notes on Monte Carlo here: sites.google.com/site/fportie...
- arxiv.org/abs/2102.05407
All have different (complementary) material. The basic concept is super general (maybe too general I suspect @nolovedeeplearning.bsky.social may argue :D ).