{"id":341,"date":"2025-08-06T01:18:47","date_gmt":"2025-08-06T01:18:47","guid":{"rendered":"https:\/\/dataopsschool.com\/blog\/?p=341"},"modified":"2025-08-06T01:18:48","modified_gmt":"2025-08-06T01:18:48","slug":"what-is-data","status":"publish","type":"post","link":"https:\/\/dataopsschool.com\/blog\/what-is-data\/","title":{"rendered":"What is Data?"},"content":{"rendered":"\n<p><\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>1. What is Data?<\/strong><\/h2>\n\n\n\n<p><strong>Data<\/strong> is any collection of facts, values, or measurements that can be recorded, stored, and processed by computers or humans.<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Simple examples:<\/strong> Numbers (42), words (\u201chello\u201d), dates (2025-08-06), true\/false (yes\/no), files, images, videos, etc.<\/li>\n\n\n\n<li><strong>In technology:<\/strong> Data is the raw material for software, analytics, machine learning, business intelligence, etc.<\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>2. Types of Data<\/strong><\/h2>\n\n\n\n<p><strong>Data can be classified in several ways. The most common are:<\/strong><\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>A. By Structure<\/strong><\/h3>\n\n\n\n<figure class=\"wp-block-table\"><table class=\"has-fixed-layout\"><thead><tr><th>Type<\/th><th>Description<\/th><th>Example<\/th><\/tr><\/thead><tbody><tr><td><strong>Structured<\/strong><\/td><td>Organized in fixed fields\/columns, like a table<\/td><td>Databases, Excel spreadsheets<\/td><\/tr><tr><td><strong>Semi-Structured<\/strong><\/td><td>Has some organization but not a strict schema<\/td><td>JSON, XML, CSV, log files<\/td><\/tr><tr><td><strong>Unstructured<\/strong><\/td><td>No pre-defined format or schema<\/td><td>Images, videos, emails, PDFs<\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>B. By Nature\/Content<\/strong><\/h3>\n\n\n\n<figure class=\"wp-block-table\"><table class=\"has-fixed-layout\"><thead><tr><th>Type<\/th><th>Description<\/th><th>Example<\/th><\/tr><\/thead><tbody><tr><td><strong>Numeric<\/strong><\/td><td>Numbers, integers, decimals<\/td><td>100, 3.14, -7<\/td><\/tr><tr><td><strong>Textual<\/strong><\/td><td>Words, sentences, text blocks<\/td><td>&#8220;Customer name&#8221;, &#8220;Review: great product&#8221;<\/td><\/tr><tr><td><strong>Categorical<\/strong><\/td><td>Labels or categories<\/td><td>Red\/Green\/Blue, Male\/Female<\/td><\/tr><tr><td><strong>Boolean<\/strong><\/td><td>True\/False, Yes\/No, 0\/1<\/td><td>true, false, 1, 0<\/td><\/tr><tr><td><strong>Date\/Time<\/strong><\/td><td>Dates, timestamps<\/td><td>2025-08-06, 12:30 PM<\/td><\/tr><tr><td><strong>Spatial<\/strong><\/td><td>Locations, coordinates<\/td><td>GPS points, maps<\/td><\/tr><tr><td><strong>Multimedia<\/strong><\/td><td>Images, audio, video, graphics<\/td><td>profile_pic.jpg, song.mp3, video.mp4<\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>C. By How It Arrives<\/strong><\/h3>\n\n\n\n<figure class=\"wp-block-table\"><table class=\"has-fixed-layout\"><thead><tr><th>Type<\/th><th>Description<\/th><th>Example<\/th><\/tr><\/thead><tbody><tr><td><strong>Batch<\/strong><\/td><td>Collected and processed in chunks<\/td><td>Daily sales report, nightly backups<\/td><\/tr><tr><td><strong>Streaming<\/strong><\/td><td>Arrives and processed in real-time<\/td><td>Website clicks, sensor data, live chat<\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>3. Sources of Data<\/strong><\/h2>\n\n\n\n<p><strong>Data can come from almost anywhere. Here are typical sources in tech and business:<\/strong><\/p>\n\n\n\n<figure class=\"wp-block-table\"><table class=\"has-fixed-layout\"><thead><tr><th>Source<\/th><th>Description<\/th><th>Example<\/th><\/tr><\/thead><tbody><tr><td><strong>Transactional<\/strong><\/td><td>Systems that record daily business<\/td><td>Sales databases, banking systems<\/td><\/tr><tr><td><strong>Operational<\/strong><\/td><td>Logs\/events from running systems<\/td><td>Web server logs, app logs, error logs<\/td><\/tr><tr><td><strong>External APIs<\/strong><\/td><td>Data from third-party services<\/td><td>Weather APIs, social media APIs, payment APIs<\/td><\/tr><tr><td><strong>Manual Entry<\/strong><\/td><td>Human input<\/td><td>Online forms, surveys, spreadsheets<\/td><\/tr><tr><td><strong>IoT\/Sensors<\/strong><\/td><td>Physical devices measuring things<\/td><td>Thermometers, cameras, GPS trackers<\/td><\/tr><tr><td><strong>Files\/Blobs<\/strong><\/td><td>Uploaded or shared files<\/td><td>CSVs, Excel, PDFs, videos<\/td><\/tr><tr><td><strong>Web Scraping<\/strong><\/td><td>Data extracted from websites<\/td><td>Product prices, news headlines<\/td><\/tr><tr><td><strong>Public Data Sets<\/strong><\/td><td>Open government or community data<\/td><td>Census data, COVID-19 stats, Wikipedia dumps<\/td><\/tr><tr><td><strong>Streaming Services<\/strong><\/td><td>Live feeds, events<\/td><td>Kafka, Kinesis, Event Hubs<\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>Summary Table<\/strong><\/h2>\n\n\n\n<figure class=\"wp-block-table\"><table class=\"has-fixed-layout\"><thead><tr><th><strong>Type<\/strong><\/th><th><strong>Examples<\/strong><\/th><\/tr><\/thead><tbody><tr><td>Structured<\/td><td>SQL databases, Excel tables<\/td><\/tr><tr><td>Semi-Structured<\/td><td>JSON, XML, CSV<\/td><\/tr><tr><td>Unstructured<\/td><td>Images, emails, audio, PDFs<\/td><\/tr><tr><td>Numeric<\/td><td>1, 2.5, -99<\/td><\/tr><tr><td>Textual<\/td><td>&#8220;Hello&#8221;, &#8220;Feedback&#8221;<\/td><\/tr><tr><td>Categorical<\/td><td>&#8220;Red&#8221;, &#8220;Male&#8221;, &#8220;Success&#8221;<\/td><\/tr><tr><td>Boolean<\/td><td>true\/false, 0\/1<\/td><\/tr><tr><td>Date\/Time<\/td><td>2024-05-23, 17:45<\/td><\/tr><tr><td>Batch<\/td><td>Daily ETL jobs<\/td><\/tr><tr><td>Streaming<\/td><td>Real-time clickstream, IoT<\/td><\/tr><tr><td>Sources<\/td><td>Databases, APIs, Logs, Sensors, Files, Scraping<\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>1. What is Data? Data is any collection of facts, values, or measurements that can be recorded, stored, and processed by computers or humans. 2. Types of&#8230; <\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-341","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/dataopsschool.com\/blog\/wp-json\/wp\/v2\/posts\/341","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dataopsschool.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dataopsschool.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dataopsschool.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/dataopsschool.com\/blog\/wp-json\/wp\/v2\/comments?post=341"}],"version-history":[{"count":1,"href":"https:\/\/dataopsschool.com\/blog\/wp-json\/wp\/v2\/posts\/341\/revisions"}],"predecessor-version":[{"id":342,"href":"https:\/\/dataopsschool.com\/blog\/wp-json\/wp\/v2\/posts\/341\/revisions\/342"}],"wp:attachment":[{"href":"https:\/\/dataopsschool.com\/blog\/wp-json\/wp\/v2\/media?parent=341"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dataopsschool.com\/blog\/wp-json\/wp\/v2\/categories?post=341"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dataopsschool.com\/blog\/wp-json\/wp\/v2\/tags?post=341"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}