Applied Metaphors: Learning TRIZ, Complexity, Data/Stats/ML using Metaphors
  1. Teaching
  2. Data Science with No Code
  3. 🕶 Happy Data are all Alike
  • Teaching
    • Data Analytics for Managers and Creators
      • Tools
        • Introduction to R and RStudio
        • Introduction to Radiant
        • Introduction to Orange
      • Descriptive Analytics
        • Data
        • Summaries
        • Counts
        • Quantities
        • Groups
        • Densities
        • Groups and Densities
        • Change
        • Proportions
        • Parts of a Whole
        • Evolution and Flow
        • Ratings and Rankings
        • Surveys
        • Time
        • Space
        • Networks
        • Experiments
        • Miscellaneous Graphing Tools, and References
      • Statistical Inference
        • 🧭 Basics of Statistical Inference
        • 🎲 Samples, Populations, Statistics and Inference
        • Basics of Randomization Tests
        • 🃏 Inference for a Single Mean
        • 🃏 Inference for Two Independent Means
        • 🃏 Inference for Comparing Two Paired Means
        • Comparing Multiple Means with ANOVA
        • Inference for Correlation
        • 🃏 Testing a Single Proportion
        • 🃏 Inference Test for Two Proportions
      • Inferential Modelling
        • Modelling with Linear Regression
        • Modelling with Logistic Regression
        • 🕔 Modelling and Predicting Time Series
      • Predictive Modelling
        • 🐉 Intro to Orange
        • ML - Regression
        • ML - Classification
        • ML - Clustering
      • Prescriptive Modelling
        • 📐 Intro to Linear Programming
        • 💭 The Simplex Method - Intuitively
        • 📅 The Simplex Method - In Excel
      • Workflow
        • Facing the Abyss
        • I Publish, therefore I Am
      • Case Studies
        • Demo:Product Packaging and Elderly People
        • Ikea Furniture
        • Movie Profits
        • Gender at the Work Place
        • Heptathlon
        • School Scores
        • Children's Games
        • Valentine’s Day Spending
        • Women Live Longer?
        • Hearing Loss in Children
        • California Transit Payments
        • Seaweed Nutrients
        • Coffee Flavours
        • Legionnaire’s Disease in the USA
        • Antarctic Sea ice
        • William Farr's Observations on Cholera in London
    • R for Artists and Managers
      • 🕶 Lab-1: Science, Human Experience, Experiments, and Data
      • Lab-2: Down the R-abbit Hole…
      • Lab-3: Drink Me!
      • Lab-4: I say what I mean and I mean what I say
      • Lab-5: Twas brillig, and the slithy toves…
      • Lab-6: These Roses have been Painted !!
      • Lab-7: The Lobster Quadrille
      • Lab-8: Did you ever see such a thing as a drawing of a muchness?
      • Lab-9: If you please sir…which way to the Secret Garden?
      • Lab-10: An Invitation from the Queen…to play Croquet
      • Lab-11: The Queen of Hearts, She Made some Tarts
      • Lab-12: Time is a Him!!
      • Iteration: Learning to purrr
      • Lab-13: Old Tortoise Taught Us
      • Lab-14: You’re are Nothing but a Pack of Cards!!
    • ML for Artists and Managers
      • 🐉 Intro to Orange
      • ML - Regression
      • ML - Classification
      • ML - Clustering
      • 🕔 Modelling Time Series
    • TRIZ for Problem Solvers
      • I am Water
      • I am What I yam
      • Birds of Different Feathers
      • I Connect therefore I am
      • I Think, Therefore I am
      • The Art of Parallel Thinking
      • A Year of Metaphoric Thinking
      • TRIZ - Problems and Contradictions
      • TRIZ - The Unreasonable Effectiveness of Available Resources
      • TRIZ - The Ideal Final Result
      • TRIZ - A Contradictory Language
      • TRIZ - The Contradiction Matrix Workflow
      • TRIZ - The Laws of Evolution
      • TRIZ - Substance Field Analysis, and ARIZ
    • Math Models for Creative Coders
      • Maths Basics
        • Vectors
        • Matrix Algebra Whirlwind Tour
        • content/courses/MathModelsDesign/Modules/05-Maths/70-MultiDimensionGeometry/index.qmd
      • Tech
        • Tools and Installation
        • Adding Libraries to p5.js
        • Using Constructor Objects in p5.js
      • Geometry
        • Circles
        • Complex Numbers
        • Fractals
        • Affine Transformation Fractals
        • L-Systems
        • Kolams and Lusona
      • Media
        • Fourier Series
        • Additive Sound Synthesis
        • Making Noise Predictably
        • The Karplus-Strong Guitar Algorithm
      • AI
        • Working with Neural Nets
        • The Perceptron
        • The Multilayer Perceptron
        • MLPs and Backpropagation
        • Gradient Descent
      • Projects
        • Projects
    • Data Science with No Code
      • Data
      • Orange
      • Summaries
      • Counts
      • Quantity
      • 🕶 Happy Data are all Alike
      • Groups
      • Change
      • Rhythm
      • Proportions
      • Flow
      • Structure
      • Ranking
      • Space
      • Time
      • Networks
      • Surveys
      • Experiments
    • Tech for Creative Education
      • 🧭 Using Idyll
      • 🧭 Using Apparatus
      • 🧭 Using g9.js
    • Literary Jukebox: In Short, the World
      • Italy - Dino Buzzati
      • France - Guy de Maupassant
      • Japan - Hisaye Yamamoto
      • Peru - Ventura Garcia Calderon
      • Russia - Maxim Gorky
      • Egypt - Alifa Rifaat
      • Brazil - Clarice Lispector
      • England - V S Pritchett
      • Russia - Ivan Bunin
      • Czechia - Milan Kundera
      • Sweden - Lars Gustaffsson
      • Canada - John Cheever
      • Ireland - William Trevor
      • USA - Raymond Carver
      • Italy - Primo Levi
      • India - Ruth Prawer Jhabvala
      • USA - Carson McCullers
      • Zimbabwe - Petina Gappah
      • India - Bharati Mukherjee
      • USA - Lucia Berlin
      • USA - Grace Paley
      • England - Angela Carter
      • USA - Kurt Vonnegut
      • Spain-Merce Rodoreda
      • Israel - Ruth Calderon
      • Israel - Etgar Keret
  • Posts
  • Blogs and Talks

On this page

  • What graphs will we see today?
    • How do these Work?
  • Bar and Column Charts in RAWgraphs
  • Dataset: Netflix Original Series
    • Examine the Data
    • Research Questions
    • Plotting a Bar Chart
    • What is the Story Here?
  • Dataset: Banned Books in the USA
    • What is the Story Here?
  • Frequency Distributions
  • 2D Frequency Distributions and Hexbin plots
    • What is the Story here?
  • An Example: Frequency Density
    • How does this work?
    • How could you explore?
    • What is the Story here?
  • Your Turn
  • Fun Stuff
  1. Teaching
  2. Data Science with No Code
  3. 🕶 Happy Data are all Alike

🕶 Happy Data are all Alike

Graphs for Single Quant Variables

Modified

March 17, 2025

What graphs will we see today?

Some of the very basic and commonly used plots for data are:
- Bar and Column Charts
- Histograms and Frequency Distributions
- Scatter Plots (if there is more than one quant variable) and
- 2D Hexbins Plots and 2D Frequency Distributions (horrors!!)

How do these Work?

Histograms are best to show the distribution of raw quantitative data, by displaying the number of values that fall within defined ranges, often called buckets or bins.

Although histograms may look similar to bar/column charts, the two are different. First, histograms show continuous data, and usually you can adjust the bucket ranges to explore frequency patterns. For example, you can shift histogram buckets from 0-1, 1-2, 2-3, etc. to 0-2, 2-4, etc.

By contrast, bar/column charts show categorical data, such as the number of apples, bananas, carrots, etc. Second, histograms do not usually show spaces between buckets because these are continuous values, while column charts show spaces to separate each category.

Bar and Column Charts in RAWgraphs

Dataset: Netflix Original Series

Here is a look at this data. It is readily available as on the in-bult datasets at RAWgraphs.

Examine the Data

name class levels n missing distribution
Genere character 10 109 0 Family Animation (29.4%) ...
Title character 109 109 0 0.03 (0.9%), 13 Reasons Why (0.9%) ...
Subgenre character 42 109 0 Animation (30.3%), Comedy (7.3%) ...
Status character 4 106 3 Pending (41.5%), Renewed (34.9%) ...
name class min Q1 median Q3 max mean sd n missing
Premiere_Year numeric 2013 2015 2016 2016 2017 2015.697248 1.101384 109 0
Seasons numeric 1 1 1 2 5 1.642202 1.041037 109 0
Episodes numeric 3 8 13 21 90 17.871560 15.671454 109 0
IMDB_Rating numeric 0 70 77 84 96 73.559633 16.966978 109 0
NoteQuantitative Data
  • Premiere_Year: Year the movie premiered
  • Seasons: No. of Seasons
  • Episodes: No. of Episodes
  • IMDB_Rating: IMDB Rating!!
NoteQualitative Data
  • Genere: 10 types of Genres
  • Title: 109 titles
  • Subgenre: 42 types of sub-Genres
  • Status: 4 levels, status on Netflix

Research Questions

Let’s try a few questions and see if they are answerable with Bar Charts. Recall that Bar Charts show counts of Qualitative variables!

Q1. How many movies of each Genere? Sort them by Genere! Q2. Which Genere has the highest average IMDB_Rating? Some grouping + aggregating needed here!

Plotting a Bar Chart

Let us create this figure:

What is the Story Here?

  • Talk Shows?? What??

Dataset: Banned Books in the USA

Here is a dataset from Jeremy Singer-Vine’s blog, Data Is Plural. This is a list of all books banned in schools across the US.

Download this data to your machine and use it on RAWGraphs.

What is the Story Here?

Frequency Distributions

2D Frequency Distributions and Hexbin plots

What is the Story here?

TBD

An Example: Frequency Density

How does this work?

Let us listen to the late great Hans Rosling from the Gapminder Project, which aims at telling stories of the world with data, to remove systemic biases about poverty, income and gender related issues.

How many are rich and how many are poor? from Gapminder on Vimeo.

How could you explore?

TBD. Add 2D contour plots and link up to hexbin plots.

What is the Story here?

Your Turn

  1. Rbnb Price Data on the French Riviera: ::: {.cell} ::: {.cell-output-display}

::: :::

  1. Apartment price vs ground living area: ::: {.cell} ::: {.cell-output-display}

::: ::: (Try a Scatter Plot too, since we have two Quant variables)

  1. Rbnb Price Data on the French Riviera: ::: {.cell} ::: {.cell-output-display}

::: :::

  1. India
  2. Old Faithful Data
  3. Income data
  4. Diamonds Data from R
  5. calmcode.io dataset

Fun Stuff

  1. See the scrolly animation for a histogram at this website: Exploring Histograms, an essay by Aran Lunzer and Amelia McNamara https://tinlizzie.org/histograms/?s=09
Back to top
Quantity
Groups

License: CC BY-SA 2.0

Website made with ❤️ and Quarto, by Arvind V.

Hosted by Netlify .