A working glossary for the Digital Age
Remember the tower of Babel? Tech talk can be a little like that. So to make sure we’re all on the same page, here’s a glossary of terms as defined by the Governor’s Blue-Ribbon Commission on the Economic Competitiveness of Computing and Data Analytics:
A step-by-step procedure for performing calculations; generally associated with data processing and automated reasoning.
Everyday tools for exploring data sets, such as queries and text search through discovery of meaningful patterns in data using advanced techniques such as machine learning, data visualization, and statistical analysis.
Data sets that are both massive and complex.
Computing resources that are delivered as a service via a network, typically the Internet.
DATA LIFE CYCLE:
Set of processes in an application that transforms raw data into actionable knowledge. It involves collection of raw data, preparation of information, analytics, visualization, and access.
Use of computational methods to find desired information in data sets.
The extraction of actionable knowledge directly from data through a process of discovery, or hypothesis formation and hypothesis testing. It is the fourth paradigm of science, following experiment, theory, and computational sciences. It refers to the conduct of data analysis as an empirical science, learning directly from data itself.
The use of automated algorithms to find and evaluate patterns in data, enabling predictions that are increasingly accurate. Often referred to as advanced analytics.
Familiar database technology in which data elements are characterized in a specific format.
Data that consists of a vast number of data points that often have multiple form and may or may not be inter-related.
Data analysis using visualization techniques, which enable researchers to look for novel patterns in data
The process of utilizing computer technology to complete a task. Computing may involve computer hardware and/or software, but must involve some form of a computer system.
Computational science is a rapidly growing multidisciplinary field that uses advanced computing capabilities to understand and solve complex problems. Computational science fuses three distinct elements: Algorithms (numerical and non-numerical) and modeling and simulation software developed to solve science (e.g., biological, physical, and social), engineering, and humanities problems; Computer and information science that develops and optimizes the advanced system hardware, software, networking, and data management components needed to solve computationally demanding problems; and the Computing Infrastructure that supports both the science and engineering problem-solving and the developmental computer and information science.
Manipulate and analyze data for use in functional or business units. Identify and develop methodologically sound and re- producible approaches for analyzing data sets that are often large and/or messy.
Drawing from various information sources, analyze, visualize, and communicate insights regarding what has happened. Create models and software that predict what is going to happen or prescribe what should happen.
Frame industry problems as analytical problems and use statistical analysis to solve them. Create the data sets and analytical tools necessary to solve industry problems and/or innovate.
A number of approaches, largely based on advanced mathematics, that are used to collect, analyze, and extract information from data sets.