Skip to content

Practice Probs

Similar Names Problem | Distance Metrics - Metrics

Practice Probs

Home
Pricing
Promote
Contributors
Contact
Blog
Blog
- Archive
  Archive
  - 2022
- Categories
  Categories
Tags
Legal
Legal
Chrome Extensions
Chrome Extensions
- Rickroller
  Rickroller
  - Solution
- Guess a Number
  Guess a Number
  - Solution
- Find and Replace
  Find and Replace
  - Solution
- StackOverflow Search
  StackOverflow Search
  - Solution
- To Do List
  To Do List
  - Solution
Git
Git
- Beginner
  Beginner
  - Softball
    
    Softball
    
    Solution
  - First Commit
    
    First Commit
    
    Solution
  - Woof
    
    Woof
    
    Solution
  - Beignet
    
    Beignet
    
    Solution
  - Who are you?
    
    Who are you?
    
    Solution
  - Who am I?
    
    Who am I?
    
    Solution
  - Snickerdoodle
    
    Snickerdoodle
    
    Solution
  - Ow
    
    Ow
    
    Solution
  - Hot Bod
    
    Hot Bod
    
    Solution
- Intermediate
  Intermediate
  - UFO Sightings
    
    UFO Sightings
    
    Solution
  - Pizza
    
    Pizza
    
    Solution
  - Copy Cat
    
    Copy Cat
    
    Solution
  - Haiku
    
    Haiku
    
    Solution
  - Whoopsies
    
    Whoopsies
    
    Solution
  - Board Games
    
    Board Games
    
    Solution
- Remotes
  Remotes
  - Fry Quotes
    
    Fry Quotes
    
    Solution
Google BigQuery
Google BigQuery
- Beginner
  Beginner
  - Austin Bikeshare Stats
    
    Austin Bikeshare Stats
    
    Solution
  - Unfair Venues
    
    Unfair Venues
    
    Solution
  - Super Sports
    
    Super Sports
    
    Solution
  - Unsold Inventory
    
    Unsold Inventory
    
    Solution
  - Woodcreek College Exams
    
    Woodcreek College Exams
    
    Solution
- Intermediate
  Intermediate
  - Trustees of Princeton University
    
    Trustees of Princeton University
    
    Solution
  - Fruit Prices
    
    Fruit Prices
    
    Solution
  - Bitcoin
    
    Bitcoin
    
    Solution
  - Lord of the Fries
    
    Lord of the Fries
    
    Solution
  - Woodcreek College Exam Results
    
    Woodcreek College Exam Results
    
    Solution
- Advanced
  Advanced
  - Iowa Liquor
    
    Iowa Liquor
    
    Solution
  - Abysmal Grades
    
    Abysmal Grades
    
    Solution
  - Don't Weight
    
    Don't Weight
    
    Solution
  - Vet Visits
    
    Vet Visits
    
    Solution
  - Thief of Catan
    
    Thief of Catan
    
    Solution
  - Sniffed It
    
    Sniffed It
    
    Solution
Matplotlib
Matplotlib
- Beginner
  Beginner
  - Dates vs Derivatives
    
    Dates vs Derivatives
    
    Solution
  - Koala Speeds
    
    Koala Speeds
    
    Solution
  - Jerky For Dogs
    
    Jerky For Dogs
    
    Solution
  - You're a Legend
    
    You're a Legend
    
    Solution
  - Drive-Thru Daiquiris
    
    Drive-Thru Daiquiris
    
    Solution
  - Seven
    
    Seven
    
    Solution
- Intermediate
  Intermediate
  - Iris
    
    Iris
    
    Solution
Metrics
Metrics
- Precision And Recall
  Precision And Recall
  - Plant Eater
    
    Plant Eater
    
    Solution
  - Perfect Brownies
    
    Perfect Brownies
    
    Solution
  - Only Child
    
    Only Child
    
    Solution
  - A-Eye
    
    A-Eye
    
    Solution
  - Misconception
    
    Misconception
    
    Solution
- Correlation Metrics
  Correlation Metrics
  - Grading Graders
    
    Grading Graders
    
    Solution
- Distance Metrics
  Distance Metrics
  - Similar Names
    
    Similar Names
    
    Solution
pytest
pytest
- Beginner
  Beginner
  - High Five
    
    High Five
    
    Solution
  - VIN Discrepancies
    
    VIN Discrepancies
    
    Solution
  - Scrabble
    
    Scrabble
    
    Solution
  - Dog Costumes
    
    Dog Costumes
    
    Solution
PyTorch
PyTorch
- Tensors
  Tensors
  - Random Block
    
    Random Block
    
    Solution
  - Fliptate
    
    Fliptate
    
    Solution
  - Vanilla Neural Network
    
    Vanilla Neural Network
    
    Solution
  - Screen Time
    
    Screen Time
    
    Solution
- Basic Models
  Basic Models
  - Pass Or Fail
    
    Pass Or Fail
    
    Solution
Next.js
Next.js
- Dunder Mifflin
  Dunder Mifflin
  - Pages
    
    Pages
    
    Solution
  - Nav and Footer
    
    Nav and Footer
    
    Solution
  - Fetch Data
    
    Fetch Data
    
    Solution
  - Dynamic Routes
    
    Dynamic Routes
    
    Solution
  - Loading UI
    
    Loading UI
    
    Solution
  - Not Found
    
    Not Found
    
    Solution
  - Styles
    
    Styles
    
    Solution
  - Images
    
    Images
    
    Solution
- Pro Jokes
  Pro Jokes
  - Scaffolding
    
    Scaffolding
    
    Solution
  - Controlled Inputs
    
    Controlled Inputs
    
    Solution
  - Firebase App Setup
    
    Firebase App Setup
    
    Solution
  - Email and Password Registration
    
    Email and Password Registration
    
    Solution
  - Email and Password Authentication
    
    Email and Password Authentication
    
    Solution
  - Display Logged In User
    
    Display Logged In User
    
    Solution
  - Sign Out
    
    Sign Out
    
    Solution
  - Forgot Password
    
    Forgot Password
    
    Solution
  - Google OAuth
    
    Google OAuth
    
    Solution
  - Email Verification
    
    Email Verification
    
    Solution
  - Extra User Data
    
    Extra User Data
    
    Solution
  - Protect Content
    
    Protect Content
    
    Solution
Python Pandas
Python Pandas
- Series
  Series
  - Baby Names
    
    Baby Names
    
    Solution
  - Bees Knees
    
    Bees Knees
    
    Solution
  - Car Shopping
    
    Car Shopping
    
    Solution
  - Price Gouging
    
    Price Gouging
    
    Solution
  - Fair Teams
    
    Fair Teams
    
    Solution
- DataFrame
  DataFrame
  - Hobbies
    
    Hobbies
    
    Solution
  - Party Time
    
    Party Time
    
    Solution
  - Vending Machines
    
    Vending Machines
    
    Solution
  - Cradle Robbers
    
    Cradle Robbers
    
    Solution
  - Potholes
    
    Potholes
    
    Solution
  - AFOLs
    
    AFOLs
    
    Solution
  - Humans
    
    Humans
    
    Solution
- Advanced
  Advanced
  - Class Transitions
    
    Class Transitions
    
    Solution
  - Rose Thorn
    
    Rose Thorn
    
    Solution
  - Product Volumes
    
    Product Volumes
    
    Solution
  - Session Groups
    
    Session Groups
    
    Solution
  - OB-Gym
    
    OB-Gym
    
    Solution
- Final Boss
  Final Boss
  - COVID Tracing
    
    COVID Tracing
    
    Solution
  - Pickle
    
    Pickle
    
    Solution
  - TV Commercials
    
    TV Commercials
    
    Solution
  - Family IQ
    
    Family IQ
    
    Solution
  - Concerts
    
    Concerts
    
    Solution
Python NumPy
Python NumPy
- Beginner
  Beginner
  - High School Reunion
    
    High School Reunion
    
    Solution
  - Nola
    
    Nola
    
    Solution
  - Gold Miner
    
    Gold Miner
    
    Solution
  - Roux
    
    Roux
    
    Solution
  - Chic-fil-A
    
    Chic-fil-A
    
    Solution
- Intermediate
  Intermediate
  - Love Distance
    
    Love Distance
    
    Solution
  - Professor Prick
    
    Professor Prick
    
    Solution
  - Psycho Parent
    
    Psycho Parent
    
    Solution
- Proficient
  Proficient
  - Movie Ratings
    
    Movie Ratings
    
    Solution
  - Big Fish
    
    Big Fish
    
    Solution
  - Taco Truck
    
    Taco Truck
    
    Solution
  - Defraud The Investors
    
    Defraud The Investors
    
    Solution
  - Pixel Artist
    
    Pixel Artist
    
    Solution
- Advanced
  Advanced
  - Population Verification
    
    Population Verification
    
    Solution
  - Prime Locations
    
    Prime Locations
    
    Solution
  - The Game of Doors
    
    The Game of Doors
    
    Solution
  - Peanut Butter
    
    Peanut Butter
    
    Solution
  - Freckle
    
    Freckle
    
    Solution
  - Get Rich Bot
    
    Get Rich Bot
    
    Solution
- Expert
  Expert
  - One-Hot-Encoding
    
    One-Hot-Encoding
    
    Solution
  - Cumulative Rainfall
    
    Cumulative Rainfall
    
    Solution
  - Table Tennis
    
    Table Tennis
    
    Solution
  - Where's Waldo
    
    Where's Waldo
    
    Solution
  - Outer Product
    
    Outer Product
    
    Solution
  - Neural Network Convolution
    
    Neural Network Convolution
    
    Solution
Python Sparse Matrices
Python Sparse Matrices
- Tinder Coach
  Tinder Coach
  - Solution
- Movie Distance
  Movie Distance
  - Solution
- Mine Field
  Mine Field
  - Solution
- Dog Shelter
  Dog Shelter
  - Solution
- Puny Computer
  Puny Computer
  - Solution
Redis
Redis
- Ticket King
  Ticket King
  - Solution
- Beach Volleyball Scores
  Beach Volleyball Scores
  - Solution
Regular Expressions In Python
Regular Expressions In Python
- Beginner
  Beginner
  - Gone with the Wind
    
    Gone with the Wind
    
    Solution
  - Star Wars
    
    Star Wars
    
    Solution
  - The Simpsons
    
    The Simpsons
    
    Solution
  - Spongebob SquarePants
    
    Spongebob SquarePants
    
    Solution
  - Harry Potter
    
    Harry Potter
    
    Solution
- Intermediate
  Intermediate
  - Legally Blonde
    
    Legally Blonde
    
    Solution
  - It's Always Sunny
    
    It's Always Sunny
    
    Solution
  - Napoleon Dynamite
    
    Napoleon Dynamite
    
    Solution
- Advanced
  Advanced
  - The Dark Knight
    
    The Dark Knight
    
    Solution
  - Mean Girls
    
    Mean Girls
    
    Solution
  - Lord of The Rings
    
    Lord of The Rings
    
    Solution
  - ET
    
    ET
    
    Solution
  - Groundhog Day
    
    Groundhog Day
    
    Solution
  - Iron Man
    
    Iron Man
    
    Solution
  - The Big Bang Theory
    
    The Big Bang Theory
    
    Solution
  - Avengers Infinity War
    
    Avengers Infinity War
    
    Solution
  - The Matrix
    
    The Matrix
    
    Solution
  - X-Men
    
    X-Men
    
    Solution
Selenium With Python
Selenium With Python
- Beginner
  Beginner
  - Making Headlines
    
    Making Headlines
    
    Solution
  - Search Struggles
    
    Search Struggles
    
    Solution
  - Tabular Troubles
    
    Tabular Troubles
    
    Solution
  - Headless Photographer
    
    Headless Photographer
    
    Solution
- Intermediate
  Intermediate
  - Form Frenzy
    
    Form Frenzy
    
    Solution
  - Expand The Album
    
    Expand The Album
    
    Solution
YouTube Data API
YouTube Data API
- MrBeast
  MrBeast
  - Solution
- Smarter Every Day
  Smarter Every Day
  - Solution
- 3Blue1Brown
  3Blue1Brown
  - Solution
- Tom Scott
  Tom Scott
  - Solution
- Mark Rober
  Mark Rober
  - Solution
- New York Coffee
  New York Coffee
  - Solution

Similar Names¶

Here's a CSV file with 1,000 distinct U.S. baby names (all lowercase).

babynames_1000.csv

   1:   aaden
   2: aaliyah
   3:    abby
   4:    abel
   5: abigail
  ---        
 996:  zander
 997:    zane
 998:    zara
 999:    zion
1000:     zoe

How many distinct (A, B) pairs of names have Levenshtein distance ≤ 3?

Distinct entries

If your result includes (aaden, allen), make sure it doesn't also include (allen, aaden).

Loading the data¶

You can load the data directly from GitHub.

Python PandasR data.table

import pandas as pd
names = pd.read_csv("https://raw.githubusercontent.com/practiceprobs/datasets/main/babynames/babynames_1000.csv")

library(data.table)
names <- fread("https://raw.githubusercontent.com/practiceprobs/datasets/main/babynames/babynames_1000.csv")