Design and Validation of Structural Causal Model: A Focus on SENSE-EGRA Datasets

Gabriel Terna Ayem (1), Augustine Shey Nsang (2), Bernard Igoche Igoche (3), Garba Naankang (4)
(1) Computer Science Department, School of Information Technology and Computing, American University of Nigeria, Yola, Nigeria
(2) Computer Science Department, School of Information Technology and Computing, American University of Nigeria, Yola, Nigeria
(3) Computer Science Department, School of Computing, University of Portsmouth, United Kingdom
(4) True-life Engineering Department, School of Aerospace, Transport, and Manufacturing, Cranfield University, United Kingdom
Fulltext View | Download
How to cite (IJASEIT) :
Ayem, G. T., Nsang , A. S., Igoche, B. I., & Naankang, G. (2024). Design and Validation of Structural Causal Model: A Focus on SENSE-EGRA Datasets. International Journal of Advanced Science Computing and Engineering, 6(1), 45–51. https://doi.org/10.62527/ijasce.6.1.200

Designing and validating a causal model's correctness from a dataset whose background knowledge is obtained from a research process is not a common phenomenon. Studies have shown that in many critical areas, such as healthcare and education, researchers develop models from direct acyclic graphs without testing them. This phenomenon is worrisome and is bound to cast a dark shadow on the inference estimates that many arise from such models. In this study, we have designed a novel application-based SCM for the first time using the background knowledge gained from the American University of Nigeria (AUN), Yola, on the letter identification subtask of the Early Grade Reading Assessment (EGRA) program on the Strengthen Education in Northeast Nigeria (SENSE-EGRA) project dataset, which the USAID sponsored. We employed the conditional independence test (CIT) criteria for the model’s correctness validation testing, and the results show a near-perfect SCM.

J. F. Box, R. A. Fisher, the Life of a Scientist. New York, NY, USA: Wiley, 1978.

K. Benson and A. J. Hartz, "A comparison of observational studies and randomized, controlled trials," New Engl. J. Med., vol. 342, no. 25, pp. 1878–1886, Jun. 2000, doi: 10.1056/nejm200006223422506.

S. L. Silverman, "From randomized controlled trials to observational studies," Amer. J. Med., vol. 122, no. 2, pp. 114–120, Feb. 2009, doi:10.1016/j.amjmed.2008.09.030.

P. W. Tennant et al., "Use of directed acyclic graphs (DAGs) to identify confounders in applied health research: review and recommendations," Int. J. Epidemiol., vol. 50, no. 2, pp. 620–632, Apr. 2021, doi: 10.1093/ije/dyaa213.

R. Guo et al., "A survey of learning causality with data: Problems and methods," ACM Comput. Surv., vol. 53, no. 4, pp. 1–37, Jul. 2020, doi:10.1145/3397269.

C. Hitchcock and M. Rédei, "Reichenbach's common cause principle," in The Stanford Encyclopedia of Philosophy, E. N. Zalta, Ed., Winter 2020.

J. Pearl, Causality: Models, Reasoning, and Inference, 2nd ed. Cambridge, U.K.: Cambridge Univ. Press, 2009.

J. Peters, D. Janzing, and B. Schölkopf, Elements of Causal Inference: Foundations and Learning Algorithms. Cambridge, MA, USA: MIT Press, 2017.

J. Neyman, "On the application of probability theory to agricultural experiments. Essay on principles. Section 9," Statist. Sci., vol. 5, no. 4, pp. 465–472, 1990.

D. B. Rubin, "Estimating causal effects of treatments in randomized and nonrandomized studies," J. Educ. Psychol., vol. 66, no. 5, pp. 688–701, Oct. 1974, doi: 10.1037/h0037350.

P. Spirtes, C. Glymour, and R. Scheines, "Discovery algorithms for causally sufficient structures," in Causation, Prediction, and Search. New York, NY, USA: Springer, 1993, pp. 103–162.

S. Greenland, J. Pearl, and J. M. Robins, "Causal diagrams for epidemiologic research," Epidemiology, vol. 10, no. 1, pp. 37–48, Jan. 1999, doi: 10.1097/00001648-199901000-00008.

S. L. Lauritzen, "Causal inference from graphical models," in Complex Stochastic Systems, O. E. Barndorff-Nielsen, D. R. Cox, and C. Klüppelberg, Eds. Boca Raton, FL, USA: Chapman and Hall/CRC, 2001, pp. 63–107.

F. Eberhardt, "Introduction to the foundations of causal discovery," Int. J. Data Sci. Anal., vol. 3, no. 2, pp. 81–91, Apr. 2017, doi: 10.1007/s41060-016-0038-6.

J. Y. Halpern, "Review of 'The Book of Why: The New Science of Cause and Effect', by Judea Pearl and Dana Mackenzie," J. Causal Inference, vol. 7, no. 1, 2019, Art. no. 20190013, doi: 10.1515/jci-2019-0013.

B. Neal, "Introduction to causal inference from a machine learning perspective," 2020, arXiv:2007.07041.

F. Elwert, "Graphical causal models," in Handbook of Causal Analysis for Social Research, S. L. Morgan, Ed. Dordrecht, Netherlands: Springer, 2013, pp. 245–273.

A. R. Nogueira et al., "Methods and tools for causal discovery and causal inference," WIREs Data Mining Knowl. Discov., vol. 12, no. 2, p. e1449, Mar. 2022, doi: 10.1002/widm.1449.

L. Yao et al., "A survey on causal inference," ACM Trans. Knowl. Discov. Data, vol. 15, no. 5, pp. 1–46, May 2021, doi:10.1145/3444944.

C. Glymour, K. Zhang, and P. Spirtes, "Review of causal discovery methods based on graphical models," Front. Genet., vol. 10, p. 524, Jun. 2019, doi: 10.3389/fgene.2019.00524.

J. Pearl, Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. San Mateo, CA, USA: Morgan Kaufmann, 1988.

L. Gultchin et al., "Differentiable causal backdoor discovery," in Proc. 23rd Int. Conf. Artif. Intell. Statist. (AISTATS), Palermo, Italy, 2020, pp. 1–11.

J. Correa and E. Bareinboim, "Causal effect identification by adjustment under confounding and selection biases," in Proc. 31st AAAI Conf. Artif. Intell., San Francisco, CA, USA, 2017, pp. 3740–3746.

J. Tian and J. Pearl, "A general identification condition for causal effects," in Proc. 18th Nat. Conf. Artif. Intell. (AAAI), Edmonton, AB, Canada, 2002, pp. 567–573.

J. Pearl and D. Mackenzie, The Book of Why: The New Science of Cause and Effect. New York, NY, USA: Basic Books, 2018.

J. Pearl, "Theoretical impediments to machine learning with seven sparks from the causal revolution," in Proc. 11th ACM Int. Conf. Web Search Data Mining (WSDM), Marina Del Rey, CA, USA, 2018, pp. 3–3.

T. S. Richardson and J. M. Robins, "Single world intervention graphs (SWIGs): A unification of the counterfactual and graphical approaches to causality," Center Statist. Social Sci., Univ. Washington, Seattle, WA, USA, Tech. Rep. 128, 2013.

J. Zhang and P. Spirtes, "Intervention, determinism, and the causal minimality condition," Synthese, vol. 182, no. 3, pp. 335–347, Oct. 2011, doi: 10.1007/s11229-010-9751-1.

A. Hauser and P. Bühlmann, "Characterization and greedy learning of interventional Markov equivalence classes of directed acyclic graphs," J. Mach. Learn. Res., vol. 13, no. 1, pp. 2409–2464, Aug. 2012.

A. Hauser and P. Bühlmann, "Jointly interventional and observational data: estimation of interventional Markov equivalence classes of directed acyclic graphs," J. Roy. Stat. Soc. B, vol. 77, no. 1, pp. 291–318, Jan. 2015, doi: 10.1111/rssb.12071.

R. Mayrhofer and M. R. Waldmann, "Sufficiency and necessity assumptions in causal structure induction," Cogn. Sci., vol. 40, no. 8, pp. 2137–2150, Nov. 2016, doi: 10.1111/cogs.12318.

J. Zhang and W. Mayer, "Weakening faithfulness: some heuristic causal discovery algorithms," Int. J. Data Sci. Anal., vol. 3, no. 2, pp. 93–104, Apr. 2017, doi: 10.1007/s41060-016-0033-y.

J. Zhang and P. L. Spirtes, "Strong faithfulness and uniform consistency in causal inference," arXiv:1212.2506, Dec. 2012.

I. Shpitser, T. VanderWeele, and J. M. Robins, "On the validity of covariate adjustment for estimating causal effects," arXiv:1203.3515, Mar. 2012.

A. Ankan, I. M. Wortel, and J. Textor, "Testing graphical causal models using the R package 'dagitty'," Curr. Protoc., vol. 1, no. 2, p. e45, Feb. 2021, doi: 10.1002/cpz1.45.

F. Thoemmes, Y. Rosseel, and J. Textor, "Local fit evaluation of structural equation models using graphical criteria," Psychol. Methods, vol. 23, no. 1, pp. 27–41, Mar. 2018, doi:10.1037/met0000147.

J. Pearl and T. S. Verma, "A theory of inferred causation," in Studies in Logic and the Foundations of Mathematics, J. F. A. K. et al., Ed. Amsterdam, Netherlands: Elsevier, 1995, vol. 134, pp. 789–811.

J. Pearl and E. Bareinboim, "Transportability of causal and statistical relations: A formal approach," in Proc. 25th AAAI Conf. Artif. Intell., San Francisco, CA, USA, 2011, pp. 1–8.

J. Pearl, "Causal inference in statistics: An overview," Statist. Surv., vol. 3, pp. 96–146, 2009, doi: 10.1214/09-SS057.