Regular Expressions 101

Community Patterns

deviantart scrape search results

0

Regular Expression
Python

r"
.*category="(.*?)".*<a class="thumb.*?href="http://.*?art\/(.*?)".*?title=".*? by (.*?),.* data-super-img="(.+?)".+</a>
"
g

Description

Scrape DeviantArt's search page HTMl to extract the category, the id, the user and the image url of each post.

Submitted by nelieru - 9 years ago