Movatterモバイル変換

importres='My Profile: https://www.geeksforgeeks.org/404.html/ in the portal of https://www.geeksforgeeks.org/'pattern=r'https?://\S+|www\.\S+'print("URLs:",re.findall(pattern,s))

Output

URLs: ['https://www.geeksforgeeks.org/404.html/', 'https://www.geeksforgeeks.org/']

Explanation:

r'https?://\S+|www\.\S+' is a regex pattern to match URLs starting with http://, https://, or www.
findall() extracts all matches in a list.

Using the urlparse()

urlparse() function from Python's urllib.parse module helps break down a URL into its key parts, such as the scheme (http, https), domain name, path, query parameters, and fragments. This function is useful for validating and extracting URLs from text by checking if a word follows a proper URL structure.

fromurllib.parseimporturlparses='My Profile: https://www.geeksforgeeks.org/404.html/ in the portal of https://www.geeksforgeeks.org/'# Split the string into wordssplit_s=s.split()# Empty list to collect URLsurls=[]forwordinsplit_s:parsed=urlparse(word)ifparsed.schemeandparsed.netloc:urls.append(word)print("URLs:",urls)

Output

URLs: ['https://www.geeksforgeeks.org/404.html/', 'https://www.geeksforgeeks.org/']

Explanation:

s.split() function splits the string to words.
then urlparse(word) function checks each word to see if it has a valid scheme (http/https) and domain.
URLs are added to url list using append() function.

Using urlextract()

urlextract is a third party library so to use it we need to first install it by giving the command "pip install urlextract" in out terminal, it offers a pre-built solution to find URLs in text. Its URLExtract class helps us to quickly identify URLs without needing custom patterns, making it a convenient choice for difficult extraction of URLs.

fromurlextractimportURLExtracts='My Profile: https://www.geeksforgeeks.org/user/Prajjwal%20/contributions/ in the portal of https://www.geeksforgeeks.org/'extractor=URLExtract()urls=extractor.find_urls(s)print("URLs:",urls)

Output

Urls:  ['https://www.geeksforgeeks.org/user/Prajjwal%20/contributions/', 'https://www.geeksforgeeks.org/']

Explanation:

import URLExtract from the urlextract library.
URLExtract() creates an extractor object to scan the string.
find_urls() detects all URLs in s and returns them as a list, no manual splitting or validation is needed.

Using startswith()

One simple approach is to split the string and check if each word starts with "http://" or "https://" using .startswith() built-in method, we can use .split() function to split the string and then check each word, if it starts with "http://" or "https://". If it does, we add it to our list of extracted URLs.

s='My Profile: https://www.geeksforgeeks.org/404.html/ in the portal of https://www.geeksforgeeks.org/'x=s.split()# Empty list to extract the URLres=[]foriinx:ifi.startswith("https:")ori.startswith("http:"):res.append(i)print("Urls: ",res)

Output

Urls:  ['https://www.geeksforgeeks.org/404.html/', 'https://www.geeksforgeeks.org/']

Explanation:

string.split() method splits the string into words.
then we checks if each word starts with http:// or https:// using the "if" statement.
if it does, then we add it to the list of URLs using .append() method.

Using find() method

find() is a built-in method in Python that is used to find a specific element in a collection, so we can use it to identify and extract a URL from a string. Here's how:

s='My Profile: https://www.geeksforgeeks.org/404.html/ in the portal of https://www.geeksforgeeks.org/'split_s=s.split()res=[]foriinsplit_s:ifi.find("https:")==0ori.find("http:")==0:res.append(i)print("Urls: ",res)

Output

Urls:  ['https://www.geeksforgeeks.org/404.html/', 'https://www.geeksforgeeks.org/']

Explanation:

s.split() funtion splits the string to words.
identify url using i.find() function.
add the URLs to the list 'res' using .append().

Related Articles:

Create Quiz

chinmoy lenka

Improve

chinmoy lenka

Improve

Article Tags :

Explore

Python Fundamentals

Python Introduction

2 min read

Input and Output in Python

Python Variables

Python Operators

Python Keywords

Python Data Types

Conditional Statements in Python

Loops in Python - For, While and Nested Loops

Python Functions

Recursion in Python

Python Lambda Functions

Python Data Structures

Python String

Python Lists

Python Tuples

Python Dictionary

Python Sets

Python Arrays

List Comprehension in Python

Advanced Python

Python OOP Concepts

11 min read

Python Exception Handling

File Handling in Python

Python Database Tutorial

Python MongoDB Tutorial

Python MySQL

Python Packages

Python Modules

Python DSA Libraries

List of Python GUI Library and Packages

Data Science with Python

NumPy Tutorial - Python Library

Pandas Tutorial

Matplotlib Tutorial

Python Seaborn Tutorial

15+ min read

StatsModel Library - Tutorial

Learning Model Building in Scikit-learn

8 min read

TensorFlow Tutorial

2 min read

PyTorch Tutorial

6 min read

Web Development with Python

Flask Tutorial

8 min read

Django Tutorial | Learn Django Framework

7 min read

Django ORM - Inserting, Updating & Deleting Data

Templating With Jinja2 in Flask

6 min read

Django Templates

Python | Build a REST API using Flask

How to Create a basic API using Django Rest Framework ?