From the course: Data Science and Analytics Career Paths and Certifications: First Steps

Unlock the full course today

Join today to access over 24,400 courses taught by industry experts.

Enabling technologies

Enabling technologies

- [Instructor] There are a number of underlying technologies that make data science a reality. These include data infrastructure, data management and visualization technologies. Data infrastructure technologies support how data is shared, processed and consumed. One of the most popular data infrastructure technologies data scientists use today is distributed computing in general. And in particular, cloud computing. There are key underlying technologies that enable cloud computing. Virtualization is one of them. Distributed file sharing is another. In particular, Redundant Array of Independent Disks or RAID and Hadoop Distributed File System or HDFS are prominent ones. Data management is handled by database management systems or DBMS. Data science requires highly scalable, reliable and efficient ways to store, manage and process data, which is why DBMS plays a critical role in data science. As big data becomes…

Contents