Skip to content
Related Articles

Related Articles

Pymeta – Search The Web For Files On A Domain To Download And Extract Metadata

View Discussion
Improve Article
Save Article
  • Last Updated : 28 Mar, 2022

PyMeta tool is an automated cyber-security tool which is developed in the Python language that has the potential to search for queries, identify and get the following file types (pdf, xls, xlsx, csv, doc, docx, ppt, pptx) from a given target domain using Google and Bing scraping engines. PyMeta tool is a scraping tool that downloads important files from the target domain server. These files can contain sensitive data like credentials or API keys which must be highly private.

Note: Assure that You have Python Environment setup on your Kali Linux System, as Pymeta is a python-based tool. Click to check the Installation process: Python Installation Steps on Linux

Installation of Pymeta Tool on Kali Linux OS

Step 1: Use the following command to install this tool in the kali Linux system

git clone https://github.com/m8r0wn/pymeta

Step 2: To execute the tool, you must first navigate to the directory and then run the tool script for its usage.

cd PyMeta

Step 3: Execute the below command to install all the required packages which are required for the execution of the Pymeta tool.

sudo pip3 install -r requirements.txt

Step 4: Execute setup.py using python3 to install the tool. 

sudo python3 setup3.py install

Step 5: Verify the installation with the command below and check the help page of the tool for better understanding.

pymeta -h

Using Pymeta Tool on Kali Linux OS

Example 1: Examining the domain geeksforgeeks.org

pymeta -d geeksforgeeks.org

In this example, we will be extracting or scraping the files from the target domain geeksforgeeks.org.

We have extracted 53 files from the GeeksforGeeks server which are various formatted files. These files may contain some sensitive data.

The specified image displays the links of files that contain the docx, pdf, etc formatted files.

In the below screenshot, we have displayed all the files which are been scraped by the tool.

Example 2: Setting up the maximum value 

python3 pymeta.py -d geeksforgeeks.org -s bing -m 20

In this example, we have given the maximum value of downloading the files from the target domain server.

All the downloaded file links are been saved in the geek_meta2.csv file. So you can also visit the links as per requirements.

My Personal Notes arrow_drop_up
Recommended Articles
Page :

Start Your Coding Journey Now!