# Data Overview

{% hint style="info" %}
You’re viewing a preview of our documentation. The full version—available to clients—provides in-depth data stats, onboarding resources, changelogs, FAQs, and additional details

<a href="https://www.scrapin.io/book-a-call" class="button primary">Access full documentation</a>
{% endhint %}

Our datasets offers **comprehensive global coverage** of professionals and companies.\
It includes detailed records across multiple geographies, industries, and job levels.

**High-level statistics:**

* **Profiles:** \~**610M million** individual profiles
* **Companies:** \~**65M million** company records
* **Geographies covered:** 200+ countries & territories
* **Industries represented:** 50+ industries (IT, Finance, Healthcare, Manufacturing, etc.)

### Data Fields

| Category            | Data Type          | Description                                                                        |
| ------------------- | ------------------ | ---------------------------------------------------------------------------------- |
| **Person Profile**  | Identity           | Full name, social profile URL, and headline — core identifiers of the professional |
|                     | Current Position   | Job title, company name, domain, and industry of the current role                  |
|                     | Career History     | Past roles and companies, with dates when available                                |
|                     | Education          | Schools, degrees, fields of study, and graduation years                            |
|                     | Skills & Languages | List of declared skills and languages when available                               |
|                     | Network & Activity | Connection count ranges, recent activity signals (posts, likes, comments)          |
|                     | Media              | Profile picture URL if public                                                      |
| **Company Profile** | Company Identity   | Company name, LinkedIn URL, domain, and industry classification                    |
|                     | Size & Structure   | Company size ranges, number of employees on LinkedIn, headquarters location        |
|                     | Background         | Founding year and company specialties                                              |
|                     | Workforce Signals  | Employees present on LinkedIn, hiring activity, job postings                       |
|                     | Technology & Tools | Technologies used (when detectable)                                                |

### Data Freshness

We ensure our database remains **up-to-date and reliable**.

* **Daily updates:** 5 to 20 millions of profiles and companies updated
* **Change tracking:** We monitor career moves, job changes, and company updates in real time

One of the key advantages of our database is that it is continuously refreshed thanks to our **live API infrastructure**. Unlike static databases that rely on monthly or quarterly updates, our system processes **millions of real-time requests every day**.

This means that whenever a profile or company detail changes — for example a new job title, a company switch, or a fresh job posting — our system is designed to capture and reflect those changes immediately.

Because of this constant stream of updates, the **majority of our database is fully refreshed every month**, ensuring that our clients always work with **up-to-date and reliable information**.

In short, our **live update pipeline** transforms our database into a living ecosystem, always aligned with the latest professional movements and company dynamics.

***

### Data Completeness

Not all fields are filled for every profile — but we track coverage to maintain transparency.\
Here’s an example (to be completed with your real stats):

| Field                   | Coverage Rate | Notes                                   |
| ----------------------- | ------------- | --------------------------------------- |
| Full Name               | 100%          | Always available                        |
| Social Profile URL      | 100%          | Always available                        |
| Professional Experience | 75%           | Some profiles may not specify           |
| Current Company         | 65%           | Some freelancers/consultants not linked |
| Education               | 30%           | Not all users fill education            |
| Skills                  | 9%            | Optional field                          |
| Email / Phone (if any)  | 10%           | Limited availability, not always public |


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://documentation.scrapin.io/dataset/basics/markdown.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
