Optimizing Healthcare Data Platforms Using Advanced ETL Algorithms for Cost-Efficiency and Scalability

Authors

  • Srinivas Bangalore Sujayendra Rao ZS Associates, USA Author
  • Naveen Kumar Siripuram CVS Health, USA Author
  • Deepak Venkatachalam CVS Health, USA Author

Keywords:

ETL optimization, healthcare data platforms, database migration, IBM MQ

Abstract

The implementation of advanced Extract, Transform, Load (ETL) algorithms Enhances cost efficiency and scalability which is useful for modern legacy healthcare data platforms. The aim of this research is to investigate the approaches to optimise ETL processes by utilising complex matching algorithms and strategic architecture assessments.

Downloads

Download data is not yet available.

References

J. L. Hernández, A. Abelló, and O. Romero, “Optimizing ETL processes in data warehouses: A survey,” ACM Computing Surveys, vol. 51, no. 3, pp. 1–41, Jun. 2018.

S. Mohanty, S. Jagadeesh, and H. Srivatsa, Big Data Imperatives: Enterprise Big Data Warehouse, BI Implementations and Analytics, Berkeley, CA, USA: Apress, 2013.

P. Vassiliadis, A. Simitsis, and S. Skiadopoulos, “Conceptual modeling for ETL processes,” in Proceedings of the 5th ACM International Workshop on Data Warehousing and OLAP, McLean, VA, USA, 2002, pp. 14–21.

A. Simitsis, P. Vassiliadis, and T. Sellis, “Optimizing ETL processes in data warehouses,” in Proceedings of the 21st International Conference on Data Engineering (ICDE), Tokyo, Japan, 2005, pp. 564–575.

A. Abelló, J. Samos, and F. Saltor, “YAM2: A multidimensional conceptual model extending UML,” Information Systems, vol. 31, no. 6, pp. 541–567, Sep. 2006.

G. Piatetsky-Shapiro, “The data-driven enterprise of 2025,” MIT Sloan Management Review, vol. 58, no. 4, pp. 3–5, 2017.

R. Kimball and M. Ross, The Data Warehouse Toolkit: The Definitive Guide to Dimensional Modeling, 3rd ed., Indianapolis, IN, USA: Wiley, 2013.

M. Golfarelli and S. Rizzi, Data Warehouse Design: Modern Principles and Methodologies, New York, NY, USA: McGraw-Hill, 2009.

M. Stonebraker and U. Çetintemel, “One size fits all: An idea whose time has come and gone,” in Proceedings of the 21st International Conference on Data Engineering (ICDE), Tokyo, Japan, 2005, pp. 2–11.

L. Golab, T. Johnson, and V. Shkapenyuk, “Scheduling updates in a real-time stream warehouse,” in Proceedings of the 33rd International Conference on Very Large Data Bases (VLDB), Vienna, Austria, 2007, pp. 827–838.

A. Simitsis, P. Vassiliadis, and T. Sellis, “State-space optimization of ETL workflows,” IEEE Transactions on Knowledge and Data Engineering, vol. 17, no. 10, pp. 1404–1419, Oct. 2005.

D. H. Park, J. Lee, and C. S. Hong, “Big data ETL workflow optimization using reinforcement learning,” in Proceedings of the 17th Asia-Pacific Network Operations and Management Symposium (APNOMS), Busan, South Korea, 2015, pp. 329–334.

M. Rahmoun, A. Wrembel, and J. Darmont, “A survey of data loading methods for data warehouses,” ACM Computing Surveys, vol. 52, no. 4, pp. 1–36, Jul. 2019.

M. T. Özsu and P. Valduriez, Principles of Distributed Database Systems, 4th ed., New York, NY, USA: Springer, 2020.

A. Inamdar and S. Vij, “A review on data migration strategies in cloud computing,” Future Generation Computer Systems, vol. 121, pp. 112–123, Mar. 2021.

C. Yin et al., “Enhanced ETL pipeline for real-time analytics in healthcare data management,” in Proceedings of the IEEE International Conference on Big Data and Smart Computing (BigComp), Kyoto, Japan, 2022, pp. 232–238.

M. Azad, R. Islam, and M. P. Sarker, “Security and privacy challenges in healthcare ETL processes: A systematic review,” IEEE Access, vol. 9, pp. 56782–56804, May 2021.

A. McAfee and E. Brynjolfsson, “Big data: The management revolution,” Harvard Business Review, vol. 90, no. 10, pp. 60–68, Oct. 2012.

C. R. Min et al., “Federated learning in healthcare: Applications, challenges, and future directions,” IEEE Internet of Things Journal, vol. 8, no. 8, pp. 6225–6238, Apr. 2021.

A. S. Abu-El-Rub, A. M. Ghazal, and M. A. Ismail, “Cloud-based ETL frameworks for high-performance healthcare analytics,” in Proceedings of the IEEE International Conference on Cloud Computing Technology and Science (CloudCom), Sydney, Australia, 2020, pp. 129–136.

Downloads

Published

08-06-2021

How to Cite

[1]
Srinivas Bangalore Sujayendra Rao, Naveen Kumar Siripuram, and Deepak Venkatachalam, “Optimizing Healthcare Data Platforms Using Advanced ETL Algorithms for Cost-Efficiency and Scalability ”, Los Angeles J Intell Syst Pattern Rec, vol. 1, pp. 93–133, Jun. 2021, Accessed: Mar. 07, 2026. [Online]. Available: https://lajispr.org/index.php/publication/article/view/41