Part of the book series:Lecture Notes in Computer Science ((LNAI,volume 14942))
Included in the following conference series:
835Accesses
Abstract
Estimating the individual treatment effect (ITE) from observational data is an important issue both theoretically and practically. While including all the pre-treatment covariates is the common practice for the inclusion of all possible confounders, it may aggravate the issue of data imbalance. In this paper, we theoretically show that including extra information would increase the variance lower bound. Based on the causal graph, we decompose the covariates into three components, namely instrumental variables (I), confounders (C), and adjustment variables (A). BothC andA should be included for the ITE estimation, whileI should be avoided since it would aggravate the imbalance issue and contains no extra information for the ITE estimation. To facilitate the decomposed representation learning, we derive the probabilistic conditions for\(\{I, C, A\}\) from the graphical definitions, and theoretically show that such decomposition can be learned in an adversarial manner. Under the guidance of such theoretical justification, we propose the ADR algorithm, an adversarial learning approach to learn the decomposed representations and simultaneously estimate the treatment effect. The proposed algorithm can be applied to both categorical and numerical treatments and the effectiveness is assured by both theoretical analyses and empirical results. Experimental results on both synthetic and real data show that the ADR Algorithm is advantageous compared to the state-of-the-art methods. The theoretical analyses also provide a path to further explore the issue of decomposed representation learning for causal inference.
This is a preview of subscription content,log in via an institution to check access.
Access this chapter
Subscribe and save
- Get 10 units per month
- Download Article/Chapter or eBook
- 1 Unit = 1 Article or 1 Chapter
- Cancel anytime
Buy Now
- Chapter
- JPY 3498
- Price includes VAT (Japan)
- eBook
- JPY 17159
- Price includes VAT (Japan)
- Softcover Book
- JPY 10581
- Price includes VAT (Japan)
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Jaskowski, M., Jaroszewicz, S.: Uplift modeling for clinical trial data. In: ICML Workshop on Clinical Data Analysis, vol. 46, pp. 79–95 (2012)
Alaa, A.M., Van Der Schaar, M.: Bayesian inference of individualized treatment effects using multi-task gaussian processes. In: Advances in Neural Information Processing Systems, vol. 30 (2017)
Alaa, A.M., Weisz, M., Van Der Schaar, M.: Deep counterfactual networks with propensity-dropout. arXiv preprintarXiv:1706.05966 (2017)
Sato, M., Singh, J., Takemori, S., Sonoda, T., Zhang, Q., Ohkuma, T.: Uplift-based evaluation and optimization of recommenders. In: Proceedings of the 13th ACM Conference on Recommender Systems, pp. 296–304 (2019)
Wan, S., et al.: GCF: generalized causal forest for heterogeneous treatment effect estimation in online marketplace. arXiv preprintarXiv:2203.10975 (2022)
Guelman, L., Guillén, M., Pérez-Marín, A.M.: A decision support framework to implement optimal personalized marketing interventions. Dec. Support Syst.72, 24–32 (2015)
Rubin, D.B.: Estimating causal effects of treatments in randomized and nonrandomized studies. J. Educ. Psychol.66(5), 688 (1974)
Splawa-Neyman, J., Dabrowska, D.M., Speed, T.P.: On the application of probability theory to agricultural experiments. essay on principles. Section 9. Stat. Sci. 465–472 (1990)
Pearl, J.: Causality. Cambridge University Press, Cambridge (2009)
Pearl, J.: Causal inference in statistics: an overview. Stat. Surv.3, 96–146 (2009)
Künzel, S.R., Sekhon, J.S., Bickel, P.J., Yu, B.: Metalearners for estimating heterogeneous treatment effects using machine learning. Proc. Natl. Acad. Sci.116(10), 4156–4165 (2019)
Imbens, G.W., Rubin, D.B.: Causal Inference for Statistics, Social, and Biomedical Sciences: An Introduction. Cambridge University Press, Cambridge (2015)
Zhong, K., et al.: DESCN: deep entire space cross networks for individual treatment effect estimation. In: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, pp. 4612–4620 (2022)
Rubin, D.B.: For objective causal inference, design trumps analysis (2008)
Rubin, D.B.: Author’s reply (to judea pearl’s and arvid sjölander’s letters to the editor). Stat. Med.28, 1420–1423 (2009)
Shalit, U., Johansson, F.D., Sontag, D.: Estimating individual treatment effect: generalization bounds and algorithms. In: International Conference on Machine Learning, pp. 3076–3085. PMLR (2017)
Johansson, F., Shalit, U., Sontag, F.: Learning representations for counterfactual inference. In: International Conference on Machine Learning, pp. 3020–3029. PMLR (2016)
Johansson, F.D., Shalit, U., Kallus, N., Sontag, D.: Generalization bounds and representation learning for estimation of potential outcomes and causal effects. J. Mach. Learn. Res.23(1), 7489–7538 (2022)
Hassanpour, N., Greiner, R.: Learning disentangled representations for counterfactual regression. In: International Conference on Learning Representations (2020)
Zhang, W., Liu, L., Li, J.: Treatment effect estimation with disentangled latent factors. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, pp. 10923–10930 (2021)
Anpeng, W., Yuan, J., Kuang, K., Li, B., Runze, W., Zhu, Q., Zhuang, Y., Fei, W.: Learning decomposed representations for treatment effect estimation. IEEE Trans. Knowl. Data Eng.35(5), 4989–5001 (2022)
Chauhan, V.K., Molaei, S., Tania, M.H., Thakur, A., Zhu, T., Clifton, D.A.: Adversarial de-confounding in individualised treatment effects estimation. In: International Conference on Artificial Intelligence and Statistics, pp. 837–849. PMLR (2023)
Hassanpour , N., Greiner, N.: Counterfactual regression with importance sampling weights. In: Twenty-Eighth International Joint Conference on Artificial Intelligence IJCAI-19 (2019)
Rubin, D.B.: Causal inference using potential outcomes: design, modeling, decisions. J. Am. Stati. Assoc.100(469), 322–331 (2005)
Rao, C.R., et al.: Information and the accuracy attainable in the estimation of statistical parameters. Breakthroughs in Statistics, pp. 235–247 (1992)
Cramér, H.: Mathematical Methods of Statistics, vol. 26. Princeton University Press (1999)
Hahn, J.: On the role of the propensity score in efficient semiparametric estimation of average treatment effects. Econometrica, 315–331 (1998)
Goodfellow, I., et al.: Generative adversarial nets. In: Neural Information Processing Systems (2014)
Kuang, K., Cui, P., Li, B., Jiang, M., Yang, S., Wang, F.: Treatment effect estimation with data-driven variable decomposition. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 31 (2017)
Kuang, K., et al.: Data-driven variable decomposition for treatment effect estimation. IEEE Trans. Knowl. Data Eng.34(5), 2120–2134 (2020)
Zhang, W., Li, J., Liu, L.: A unified survey of treatment effect heterogeneity modelling and uplift modelling. ACM Comput. Surv. (CSUR)54(8), 1–36 (2021)
Olaya, D., Coussement, K., Verbeke, W.: A survey and benchmarking study of multitreatment uplift modeling. Data Min. Knowl. Disc.34, 273–308 (2020)
Yao, L., Li, S., Li, Y., Huai, M., Gao, J., Zhang, A.: Representation learning for treatment effect estimation from observational data. In: Advances in Neural Information Processing Systems, vol. 31 (2018)
Hill, J.L.: Bayesian nonparametric modeling for causal inference. J. Comput. Graphical Stat.20(1), 217–240 (2011)
Dorie, V.: NPCI: non-parametrics for causal inference (2016).https://github.com/vdorie/npci
Author information
Authors and Affiliations
JD Technology, Beijing, China
Xiangyu Zheng, Guogang Tian, Sen Wang & Zhixiang Huang
- Xiangyu Zheng
You can also search for this author inPubMed Google Scholar
- Guogang Tian
You can also search for this author inPubMed Google Scholar
- Sen Wang
You can also search for this author inPubMed Google Scholar
- Zhixiang Huang
You can also search for this author inPubMed Google Scholar
Corresponding author
Correspondence toXiangyu Zheng.
Editor information
Editors and Affiliations
LTCI, Télécom Paris, Palaiseau Cedex, France
Albert Bifet
KU Leuven, Leuven, Belgium
Jesse Davis
Faculty of Informatics, Vytautas Magnus University, Akademija, Lithuania
Tomas Krilavičius
Institute of Computer Science, University of Tartu, Tartu, Estonia
Meelis Kull
Department of Computer Science, Bundeswehr University Munich, Munich, Germany
Eirini Ntoutsi
Department of Computer Science, University of Helsinki, Helsinki, Finland
Indrė Žliobaitė
1Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Zheng, X., Tian, G., Wang, S., Huang, Z. (2024). ADR: An Adversarial Approach to Learn Decomposed Representations for Causal Inference. In: Bifet, A., Davis, J., Krilavičius, T., Kull, M., Ntoutsi, E., Žliobaitė, I. (eds) Machine Learning and Knowledge Discovery in Databases. Research Track. ECML PKDD 2024. Lecture Notes in Computer Science(), vol 14942. Springer, Cham. https://doi.org/10.1007/978-3-031-70344-7_16
Download citation
Published:
Publisher Name:Springer, Cham
Print ISBN:978-3-031-70343-0
Online ISBN:978-3-031-70344-7
eBook Packages:Computer ScienceComputer Science (R0)
Share this paper
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative