Firmograph API
API documentation
company
list
Query Parameters
The following parameters should be included as part of a URL query string.
Parameter | Description |
---|---|
limit | Number of results to return per page. |
offset | The initial index from which to return the results. |
address | Address of company to search for (whole address string) |
name | Name of company to search for |
read
Path Parameters
The following parameters should be included in the URL path.
Parameter | Description |
---|---|
id required | A unique integer value identifying this Company. |
Query Parameters
The following parameters should be included as part of a URL query string.
Parameter | Description |
---|---|
address | Address of company to search for (whole address string) |
name | Name of company to search for |
domain
list
List domains
Query Parameters
The following parameters should be included as part of a URL query string.
Parameter | Description |
---|---|
limit | Number of results to return per page. |
offset | The initial index from which to return the results. |
date_checked | Datetime of the last time domain was crawled. Use `date_checked_after='...'` and/or `date_checked_before='...'` |
date_created | Datetime of the domain insertion into DB. Use `date_created_after='...'` and/or `date_created_before='...'` |
source | Source of domain (`user_search`, `linkedin`, `test`, `redirect`, `merk`, `usa_b2b`, `ssl_cert`, `sk`) |
crawl_status | Crawl status (`none`, `unknown`, `success`, `no_contacts`, `http_error`, `parking`, `redirect`, `black_redir`, `robots_txt_disallow_all`, `cannot_be_displayed_err`, `default_page`, `no_contacts_need_js`) |
merk_status | MERK status (`none`, `paired`, `non_paired`) |
value | Domain name |
ordering | Ordering of results (for multiple ordering use values separated by a coma) |
country | Domain country |
create
Create domain
Request Body
The request body should be a "application/json"
encoded object, containing the following items.
Parameter | Description |
---|---|
value required | Domain name |
source required | (`user_search`, `linkedin`, `test`, `redirect`, `merk`, `usa_b2b`, `ssl_cert`, `sk`) |
httperr | (`none`, `timeout`, `dns_lookup`, `connection_lost`, `connection_refused`, `response_never_received`, `ignore_request`, `connection_done`, `accepted`, `no_data`, `bad_request`, `unauthorized`, `forbidden`, `not_found`, `method_not_allowed`, `server_error`, `bad_gateway`, `service_unavailable`, `gateway_timeout`, `other`) |
crawl_status | (`none`, `unknown`, `success`, `no_contacts`, `http_error`, `parking`, `redirect`, `black_redir`, `robots_txt_disallow_all`, `cannot_be_displayed_err`, `default_page`, `no_contacts_need_js`) |
merk_status | (`none`, `paired`, `non_paired`) |
protocol | (`unknown`, `https`, `http`) |
jobs | (`no_career_links`, `no_vacancies`, `vacancies`, `unknown`) |
extra_contact_urls | Extra URLs to scrape from |
country | |
date_checked | |
www_prefix | Default adress is www.domain.tld |
date_jobs_checked | |
confirmed_values | |
technologies | |
ip | |
pages_to_download | Max pages to download during full crawl. If NONE is set `VISIT_PAGES_PER_DOMAIN` scrapy setting value is used |
path_sitemap |
read
Retrieve data about domain
Path Parameters
The following parameters should be included in the URL path.
Parameter | Description |
---|---|
id required |
Query Parameters
The following parameters should be included as part of a URL query string.
Parameter | Description |
---|---|
date_checked | Datetime of the last time domain was crawled. Use `date_checked_after='...'` and/or `date_checked_before='...'` |
date_created | Datetime of the domain insertion into DB. Use `date_created_after='...'` and/or `date_created_before='...'` |
source | Source of domain (`user_search`, `linkedin`, `test`, `redirect`, `merk`, `usa_b2b`, `ssl_cert`, `sk`) |
crawl_status | Crawl status (`none`, `unknown`, `success`, `no_contacts`, `http_error`, `parking`, `redirect`, `black_redir`, `robots_txt_disallow_all`, `cannot_be_displayed_err`, `default_page`, `no_contacts_need_js`) |
merk_status | MERK status (`none`, `paired`, `non_paired`) |
value | Domain name |
ordering | Ordering of results (for multiple ordering use values separated by a coma) |
country | Domain country |
update
Update domain
Path Parameters
The following parameters should be included in the URL path.
Parameter | Description |
---|---|
id required |
Query Parameters
The following parameters should be included as part of a URL query string.
Parameter | Description |
---|---|
date_checked | Datetime of the last time domain was crawled. Use `date_checked_after='...'` and/or `date_checked_before='...'` |
date_created | Datetime of the domain insertion into DB. Use `date_created_after='...'` and/or `date_created_before='...'` |
source | Source of domain (`user_search`, `linkedin`, `test`, `redirect`, `merk`, `usa_b2b`, `ssl_cert`, `sk`) |
crawl_status | Crawl status (`none`, `unknown`, `success`, `no_contacts`, `http_error`, `parking`, `redirect`, `black_redir`, `robots_txt_disallow_all`, `cannot_be_displayed_err`, `default_page`, `no_contacts_need_js`) |
merk_status | MERK status (`none`, `paired`, `non_paired`) |
value | Domain name |
ordering | Ordering of results (for multiple ordering use values separated by a coma) |
country | Domain country |
Request Body
The request body should be a "application/json"
encoded object, containing the following items.
Parameter | Description |
---|---|
value required | Domain name |
source required | (`user_search`, `linkedin`, `test`, `redirect`, `merk`, `usa_b2b`, `ssl_cert`, `sk`) |
httperr | (`none`, `timeout`, `dns_lookup`, `connection_lost`, `connection_refused`, `response_never_received`, `ignore_request`, `connection_done`, `accepted`, `no_data`, `bad_request`, `unauthorized`, `forbidden`, `not_found`, `method_not_allowed`, `server_error`, `bad_gateway`, `service_unavailable`, `gateway_timeout`, `other`) |
crawl_status | (`none`, `unknown`, `success`, `no_contacts`, `http_error`, `parking`, `redirect`, `black_redir`, `robots_txt_disallow_all`, `cannot_be_displayed_err`, `default_page`, `no_contacts_need_js`) |
merk_status | (`none`, `paired`, `non_paired`) |
protocol | (`unknown`, `https`, `http`) |
jobs | (`no_career_links`, `no_vacancies`, `vacancies`, `unknown`) |
extra_contact_urls | Extra URLs to scrape from |
country | |
date_checked | |
www_prefix | Default adress is www.domain.tld |
date_jobs_checked | |
confirmed_values | |
technologies | |
ip | |
pages_to_download | Max pages to download during full crawl. If NONE is set `VISIT_PAGES_PER_DOMAIN` scrapy setting value is used |
path_sitemap |
partial_update
Partially update domain
Path Parameters
The following parameters should be included in the URL path.
Parameter | Description |
---|---|
id required |
Query Parameters
The following parameters should be included as part of a URL query string.
Parameter | Description |
---|---|
date_checked | Datetime of the last time domain was crawled. Use `date_checked_after='...'` and/or `date_checked_before='...'` |
date_created | Datetime of the domain insertion into DB. Use `date_created_after='...'` and/or `date_created_before='...'` |
source | Source of domain (`user_search`, `linkedin`, `test`, `redirect`, `merk`, `usa_b2b`, `ssl_cert`, `sk`) |
crawl_status | Crawl status (`none`, `unknown`, `success`, `no_contacts`, `http_error`, `parking`, `redirect`, `black_redir`, `robots_txt_disallow_all`, `cannot_be_displayed_err`, `default_page`, `no_contacts_need_js`) |
merk_status | MERK status (`none`, `paired`, `non_paired`) |
value | Domain name |
ordering | Ordering of results (for multiple ordering use values separated by a coma) |
country | Domain country |
Request Body
The request body should be a "application/json"
encoded object, containing the following items.
Parameter | Description |
---|---|
value | Domain name |
source | (`user_search`, `linkedin`, `test`, `redirect`, `merk`, `usa_b2b`, `ssl_cert`, `sk`) |
httperr | (`none`, `timeout`, `dns_lookup`, `connection_lost`, `connection_refused`, `response_never_received`, `ignore_request`, `connection_done`, `accepted`, `no_data`, `bad_request`, `unauthorized`, `forbidden`, `not_found`, `method_not_allowed`, `server_error`, `bad_gateway`, `service_unavailable`, `gateway_timeout`, `other`) |
crawl_status | (`none`, `unknown`, `success`, `no_contacts`, `http_error`, `parking`, `redirect`, `black_redir`, `robots_txt_disallow_all`, `cannot_be_displayed_err`, `default_page`, `no_contacts_need_js`) |
merk_status | (`none`, `paired`, `non_paired`) |
protocol | (`unknown`, `https`, `http`) |
jobs | (`no_career_links`, `no_vacancies`, `vacancies`, `unknown`) |
extra_contact_urls | Extra URLs to scrape from |
country | |
date_checked | |
www_prefix | Default adress is www.domain.tld |
date_jobs_checked | |
confirmed_values | |
technologies | |
ip | |
pages_to_download | Max pages to download during full crawl. If NONE is set `VISIT_PAGES_PER_DOMAIN` scrapy setting value is used |
path_sitemap |
domain_content
list
List text content of domains
Query Parameters
The following parameters should be included as part of a URL query string.
Parameter | Description |
---|---|
limit | Number of results to return per page. |
offset | The initial index from which to return the results. |
date_checked | Datetime of the last time domain was crawled. Use `date_checked_after='...'` and/or `date_checked_before='...'` |
date_created | Datetime of the domain insertion into DB. Use `date_created_after='...'` and/or `date_created_before='...'` |
source | Source of domain (`user_search`, `linkedin`, `test`, `redirect`, `merk`, `usa_b2b`, `ssl_cert`, `sk`) |
crawl_status | Crawl status (`none`, `unknown`, `success`, `no_contacts`, `http_error`, `parking`, `redirect`, `black_redir`, `robots_txt_disallow_all`, `cannot_be_displayed_err`, `default_page`, `no_contacts_need_js`) |
merk_status | MERK status (`none`, `paired`, `non_paired`) |
value | Domain name |
ordering | Ordering of results (for multiple ordering use values separated by a coma) |
country | Domain country |
read
Retrieve text content of domain
Path Parameters
The following parameters should be included in the URL path.
Parameter | Description |
---|---|
id required | A unique integer value identifying this Domain. |
Query Parameters
The following parameters should be included as part of a URL query string.
Parameter | Description |
---|---|
date_checked | Datetime of the last time domain was crawled. Use `date_checked_after='...'` and/or `date_checked_before='...'` |
date_created | Datetime of the domain insertion into DB. Use `date_created_after='...'` and/or `date_created_before='...'` |
source | Source of domain (`user_search`, `linkedin`, `test`, `redirect`, `merk`, `usa_b2b`, `ssl_cert`, `sk`) |
crawl_status | Crawl status (`none`, `unknown`, `success`, `no_contacts`, `http_error`, `parking`, `redirect`, `black_redir`, `robots_txt_disallow_all`, `cannot_be_displayed_err`, `default_page`, `no_contacts_need_js`) |
merk_status | MERK status (`none`, `paired`, `non_paired`) |
value | Domain name |
ordering | Ordering of results (for multiple ordering use values separated by a coma) |
country | Domain country |
exports
for_crawler
cursor > list
list domains (only fields necessary for crawler are returned)
Query Parameters
The following parameters should be included as part of a URL query string.
Parameter | Description |
---|---|
cursor | The pagination cursor value. |
limit | Number of results to return per page. |
date_checked | Datetime of the last time domain was crawled. Use `date_checked_after='...'` and/or `date_checked_before='...'` |
date_created | Datetime of the domain insertion into DB. Use `date_created_after='...'` and/or `date_created_before='...'` |
source | Source of domain (`user_search`, `linkedin`, `test`, `redirect`, `merk`, `usa_b2b`, `ssl_cert`, `sk`) |
crawl_status | Crawl status (`none`, `unknown`, `success`, `no_contacts`, `http_error`, `parking`, `redirect`, `black_redir`, `robots_txt_disallow_all`, `cannot_be_displayed_err`, `default_page`, `no_contacts_need_js`) |
merk_status | MERK status (`none`, `paired`, `non_paired`) |
value | Domain name |
ordering | Ordering of results (for multiple ordering use values separated by a coma) |
has_emails | Only domains with emails (`has_emails=true`), or without emails (`has_emails=false`). Leave out this parameter to include both. |
has_phones | Only domains with phones (`has_phones=true`), or without phones (`has_phones=false`). Leave out this parameter to include both. |
httperr | HTTP error (`none`, `timeout`, `dns_lookup`, `connection_lost`, `connection_refused`, `response_never_received`, `ignore_request`, `connection_done`, `accepted`, `no_data`, `bad_request`, `unauthorized`, `forbidden`, `not_found`, `method_not_allowed`, `server_error`, `bad_gateway`, `service_unavailable`, `gateway_timeout`, `other`) |
contact_urls | Whether to include contact links in the response |
is_subdomain | Set to `true` to select only domains with `is_subdomain=True`. |
country | Domain country |
domain > list
list domains (only fields necessary for crawler are returned)
Query Parameters
The following parameters should be included as part of a URL query string.
Parameter | Description |
---|---|
cursor | The pagination cursor value. |
limit | Number of results to return per page. |
date_checked | Datetime of the last time domain was crawled. Use `date_checked_after='...'` and/or `date_checked_before='...'` |
date_created | Datetime of the domain insertion into DB. Use `date_created_after='...'` and/or `date_created_before='...'` |
source | Source of domain (`user_search`, `linkedin`, `test`, `redirect`, `merk`, `usa_b2b`, `ssl_cert`, `sk`) |
crawl_status | Crawl status (`none`, `unknown`, `success`, `no_contacts`, `http_error`, `parking`, `redirect`, `black_redir`, `robots_txt_disallow_all`, `cannot_be_displayed_err`, `default_page`, `no_contacts_need_js`) |
merk_status | MERK status (`none`, `paired`, `non_paired`) |
value | Domain name |
ordering | Ordering of results (for multiple ordering use values separated by a coma) |
has_emails | Only domains with emails (`has_emails=true`), or without emails (`has_emails=false`). Leave out this parameter to include both. |
has_phones | Only domains with phones (`has_phones=true`), or without phones (`has_phones=false`). Leave out this parameter to include both. |
httperr | HTTP error (`none`, `timeout`, `dns_lookup`, `connection_lost`, `connection_refused`, `response_never_received`, `ignore_request`, `connection_done`, `accepted`, `no_data`, `bad_request`, `unauthorized`, `forbidden`, `not_found`, `method_not_allowed`, `server_error`, `bad_gateway`, `service_unavailable`, `gateway_timeout`, `other`) |
contact_urls | Whether to include contact links in the response |
is_subdomain | Set to `true` to select only domains with `is_subdomain=True`. |
country | Domain country |