Definition of Data
Data is a collection of statistics or information that can be processed, stored, and analyzed. It can take various forms, including numbers, text, images, audio, or video. Data is used in decision-making, research, and various applications across any field.
Categorization of Data
1. Based on Structure:
- Structured
Data – Organized
and stored in a fixed format, such as databases and spreadsheets (e.g.,
customer records, sales data).
- Unstructured
Data – Lacks a
predefined format, such as text files, videos, images, and social media
posts.
- Semi-structured
Data – Has some
organization but does not fit into traditional databases (e.g., JSON, XML,
emails).
2. Based on Source:
- Primary Data – Collected directly from the
source for a specific purpose (e.g., surveys, experiments).
- Secondary
Data – Collected
by someone else and used for analysis (e.g., government reports, research
papers).
3. Based on Nature:
- Quantitative
Data – Expressed
in numbers and measurable (e.g., revenue, temperature, age).
- Qualitative
Data –
Descriptive and non-numeric (e.g., customer reviews, interviews).
4. Based on Distribution
- Discrete Data – Countable and finite values
(e.g., number of employees, number of products sold).
- Continuous Data – Can take any value within a
range (e.g., height, weight, temperature).
4. Based on Usage in Analytics:
- Nominal Data – Categorical data without a
specific order (e.g., gender, country names).
- Ordinal Data – Categorical data with a
defined order (e.g., rankings, customer satisfaction levels).
5. Based on Sensitivity:
- Public Data – Available to anyone (e.g.,
government statistics, published research).
- Private Data – Restricted to specific users
(e.g., internal company reports, employee records).
- Sensitive
Data – Requires
protection due to privacy concerns (e.g., financial records, medical
information).
- Personal
Data – Related
to an individual’s identity (e.g., name, address, phone number).






No comments:
Post a Comment