What is Data?


1. What is Data?

Data is any collection of facts, values, or measurements that can be recorded, stored, and processed by computers or humans.

  • Simple examples: Numbers (42), words (“hello”), dates (2025-08-06), true/false (yes/no), files, images, videos, etc.
  • In technology: Data is the raw material for software, analytics, machine learning, business intelligence, etc.

2. Types of Data

Data can be classified in several ways. The most common are:

A. By Structure

TypeDescriptionExample
StructuredOrganized in fixed fields/columns, like a tableDatabases, Excel spreadsheets
Semi-StructuredHas some organization but not a strict schemaJSON, XML, CSV, log files
UnstructuredNo pre-defined format or schemaImages, videos, emails, PDFs

B. By Nature/Content

TypeDescriptionExample
NumericNumbers, integers, decimals100, 3.14, -7
TextualWords, sentences, text blocks“Customer name”, “Review: great product”
CategoricalLabels or categoriesRed/Green/Blue, Male/Female
BooleanTrue/False, Yes/No, 0/1true, false, 1, 0
Date/TimeDates, timestamps2025-08-06, 12:30 PM
SpatialLocations, coordinatesGPS points, maps
MultimediaImages, audio, video, graphicsprofile_pic.jpg, song.mp3, video.mp4

C. By How It Arrives

TypeDescriptionExample
BatchCollected and processed in chunksDaily sales report, nightly backups
StreamingArrives and processed in real-timeWebsite clicks, sensor data, live chat

3. Sources of Data

Data can come from almost anywhere. Here are typical sources in tech and business:

SourceDescriptionExample
TransactionalSystems that record daily businessSales databases, banking systems
OperationalLogs/events from running systemsWeb server logs, app logs, error logs
External APIsData from third-party servicesWeather APIs, social media APIs, payment APIs
Manual EntryHuman inputOnline forms, surveys, spreadsheets
IoT/SensorsPhysical devices measuring thingsThermometers, cameras, GPS trackers
Files/BlobsUploaded or shared filesCSVs, Excel, PDFs, videos
Web ScrapingData extracted from websitesProduct prices, news headlines
Public Data SetsOpen government or community dataCensus data, COVID-19 stats, Wikipedia dumps
Streaming ServicesLive feeds, eventsKafka, Kinesis, Event Hubs

Summary Table

TypeExamples
StructuredSQL databases, Excel tables
Semi-StructuredJSON, XML, CSV
UnstructuredImages, emails, audio, PDFs
Numeric1, 2.5, -99
Textual“Hello”, “Feedback”
Categorical“Red”, “Male”, “Success”
Booleantrue/false, 0/1
Date/Time2024-05-23, 17:45
BatchDaily ETL jobs
StreamingReal-time clickstream, IoT
SourcesDatabases, APIs, Logs, Sensors, Files, Scraping

Leave a Comment