Web Scraping Databases with Mechanical Soup and SQlite

Hi Everyone! In this step by step tutorial, we will extract a huge table of data from the internet and store it inside an SQLite database! To keep things simple I've chosen a Wikipedia table, but I highly encourage you to apply the same principles on data that updates a bit more frequently (for example weather forecasts) 😃 If you're curious about my IDE - I'm using Wayscript which is now available for the wide public! you no longer need an invitation, you can simply sign up with the following link: https://app.wayscript.com ⭐clone complete tutorial code⭐ https://app.wayscript.com/lairs/517c9... ā° TIMESTAMPS ā° *************************************** 00:00 - intro 00:34 - imports and installs 01:42 - web scraping with mechanical soup 02:20 - select HTML table elements 03:47 - extract element attributes 06:11 - find the index value of a list item 07:13 - extract multiple columns of table data 09:44 - organize extracted columns 12:44 - enumerate function 14:02 - dictionary to data frame 14:53 - create SQLite database 15:36 - create SQLite table 16:35 - insert Pandas data frame into SQlite table 17:26 - save data permanently inside database file 18:49 - thanks for watching! šŸ’» CODE AND IMPORTANT LINKS šŸ’» *************************************** ⭐ URL used in the tutorial: https://en.wikipedia.org/wiki/Compari... ⭐ complete code repository on Github: https://github.com/MariyaSha/Webscrap... ⭐install SQLite on Linux: sudo apt install sqlite3 ⭐install SQLite on Windows: Download the Precompiled Binaries for Windows zip file from SQLite docs: https://www.sqlite.org/download.html ⭐install SQLite on MAC or Anaconda: no need to install - you already have it! 😁 ⭐ code used in the tutorial: column_names = ["Founder", "Maintainer", "Initial_Release_Year", "Current_Stable_Version", "Security_Updates", "Release_Date", "System_Distribution_Commitment", "Forked_From", "Target_Audience", "Cost", "Status"] šŸ“½ļø RELATED TUTORIALSšŸ“½ļø *************************************** šŸŒž Much Better HTML table Web Scraping with Pandas:    • MuchĀ BetterĀ WebĀ ScrapingĀ withĀ PandasĀ -Ā Aut...Ā Ā  šŸŒž SQLite Databases for Beginners:    • SQLiteĀ BackendĀ forĀ BeginnersĀ -Ā CreateĀ Quic...Ā Ā  šŸŒž Web Scraping Images with Mechanical Soup:    • BetterĀ WebĀ ScrapingĀ withĀ MechanicalĀ SoupĀ -...Ā Ā  šŸŒž Web Scraping Text with Beautiful Soup:    • WebĀ ScrapingĀ withĀ BeautifulĀ SoupĀ -Ā MakeĀ Da...Ā Ā 

Plotting Google Trends Graphs with Matplotlib & Pandas
ā–¶ļøŽ

Plotting Google Trends Graphs with Matplotlib & Pandas

Basic Guide to Pandas! Tricks, Shortcuts, Must Know Commands! Python for Beginners
ā–¶ļøŽ

Basic Guide to Pandas! Tricks, Shortcuts, Must Know Commands! Python for Beginners

Playwright Web Scraping + CAPTCHA Bypass Tutorial
ā–¶ļøŽ

Playwright Web Scraping + CAPTCHA Bypass Tutorial

SQLite Advanced Tutorial | Short Course with Examples!
ā–¶ļøŽ

SQLite Advanced Tutorial | Short Course with Examples!

Billionaire's WARNING: I'm SELLING. The Crash Is Already Here!
ā–¶ļøŽ

Billionaire's WARNING: I'm SELLING. The Crash Is Already Here!

System Design Explained: APIs, Databases, Caching, CDNs, Load Balancing & Production Infra
ā–¶ļøŽ

System Design Explained: APIs, Databases, Caching, CDNs, Load Balancing & Production Infra

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker
ā–¶ļøŽ

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker

Using Large Language Models | Build Your Own LLM Workshop #1
ā–¶ļøŽ

Using Large Language Models | Build Your Own LLM Workshop #1

List Comprehension - BEST Python feature !!! Fast and Efficient
ā–¶ļøŽ

List Comprehension - BEST Python feature !!! Fast and Efficient

Create GUI App with Tkinter and SQLite - Step by Step Python Tutorial for Beginners
ā–¶ļøŽ

Create GUI App with Tkinter and SQLite - Step by Step Python Tutorial for Beginners

šŸ­šŸµšŸµšŸµ š——š—„š—œš—™š—§ š—œš—”š—§š—¢ š— š—˜š— š—¢š—„š—¬ // Synthwave, Vaporwave, Cyberpunk, Chillwave, Retrowave, Dreamwave Playlist
ā–¶ļøŽ

šŸ­šŸµšŸµšŸµ š——š—„š—œš—™š—§ š—œš—”š—§š—¢ š— š—˜š— š—¢š—„š—¬ // Synthwave, Vaporwave, Cyberpunk, Chillwave, Retrowave, Dreamwave Playlist

Stop Sharing API Keys with LLMs - Use Docker MCP Catalog Instead! šŸ¤–šŸ’”
ā–¶ļøŽ

Stop Sharing API Keys with LLMs - Use Docker MCP Catalog Instead! šŸ¤–šŸ’”

Advanced Web Scraping Tutorial! (w/ Python Beautiful Soup Library)
ā–¶ļøŽ

Advanced Web Scraping Tutorial! (w/ Python Beautiful Soup Library)

JANITOR vs THE BIGGEST GUYS IN THE GYM. They Didn’t Expect THAT
ā–¶ļøŽ

JANITOR vs THE BIGGEST GUYS IN THE GYM. They Didn’t Expect THAT

SQLite Backend for Beginners - Create Quick Databases with Python and SQL
ā–¶ļøŽ

SQLite Backend for Beginners - Create Quick Databases with Python and SQL

Super-KI? Die große Lüge der Tech-Konzerne
ā–¶ļøŽ

Super-KI? Die große Lüge der Tech-Konzerne

Automate with Python – Full Course for Beginners
ā–¶ļøŽ

Automate with Python – Full Course for Beginners

The Hard Fall of Porsche
ā–¶ļøŽ

The Hard Fall of Porsche

How AI agents & Claude skills work (Clearly Explained)
ā–¶ļøŽ

How AI agents & Claude skills work (Clearly Explained)

D. Richard Hipp - SQLite [The Databaseology Lectures - CMU Fall 2015]
ā–¶ļøŽ

D. Richard Hipp - SQLite [The Databaseology Lectures - CMU Fall 2015]