Matt Joyce
7d2efe5acb
Added request timeouts
...
connection timeout to 5 seconds and the response timeout to 10
PyLint W3101
2024-05-23 08:39:19 +10:00
Matt Joyce
96b19172a1
Removed trailing whitespace
...
PyLint C0303: Trailing whitespace (trailing-whitespace)
2024-05-23 08:30:23 +10:00
Matt Joyce
6216c85322
Time module already imported
...
Pylint
W0404: Reimport 'time' (imported line 16) (reimported)
C0415: Import outside toplevel (time) (import-outside-toplevel)
2024-05-23 08:21:32 +10:00
Matt Joyce
8adf2b7132
Added Docstrings for functions
...
PyLint C0116: Missing function or method docstring (missing-function-docstring)
2024-05-23 08:20:32 +10:00
Matt Joyce
971e1f85c4
Added module docstring
...
PyLint C0114 - missing-module-docstring
2024-05-23 08:03:58 +10:00
Matt Joyce
8d041c05b4
rearranged logic for FIRECRAWL_API_URL
...
It would not use the ENV unless the param was set to None which was counter-intuitive.
2024-05-23 08:00:56 +10:00
rafaelsideguide
aa6df4305e
crawl load tests 6 and 7
2024-05-22 18:20:24 -03:00
Nicolas
4e39701644
Update main.py
2024-05-22 12:59:56 -07:00
rafaelsideguide
73f1d09d39
Update website_params.ts
2024-05-22 15:07:12 -03:00
Nicolas
3aa5f26627
Update main.py
2024-05-22 10:45:43 -07:00
Nicolas
3e63985e53
Update main.py
2024-05-22 10:40:47 -07:00
rafaelsideguide
4dfc371241
Update index.test.ts
2024-05-22 14:38:41 -03:00
rafaelsideguide
f4a3469b9e
Merge branch 'main' into bug/crawl-limit
2024-05-22 14:27:28 -03:00
rafaelsideguide
ff147f1f51
load testing for crawl
2024-05-22 14:26:29 -03:00
Nicolas
0d187f0425
Merge pull request #77 from tractorjuice/patch-1
...
Add additional file extensions to crawler.ts
2024-05-22 10:16:49 -07:00
rafaelsideguide
04a0bef0fb
Merge branch 'main' into test/load-testing
2024-05-22 11:26:19 -03:00
rafaelsideguide
e4573c08ca
Update website_params.ts
2024-05-22 11:24:48 -03:00
rafaelsideguide
f9ae1729b6
Update firecrawl.py
2024-05-22 09:40:38 -03:00
rafaelsideguide
068a240ab4
load tests for scrape route
2024-05-22 09:30:32 -03:00
Nicolas
cb2bd0e71f
Update index.test.ts
2024-05-21 19:03:32 -07:00
Nicolas
253abb849f
Update rate-limiter.ts
2024-05-21 18:53:58 -07:00
Nicolas
229b9908d2
Nick: only enable hyper dx in prod
2024-05-21 18:52:46 -07:00
Nicolas
a8ff295977
Update single_url.ts
2024-05-21 18:50:42 -07:00
Nicolas
a5e718b084
Nick: improvements
2024-05-21 18:34:23 -07:00
Nicolas
6285f12cd1
Merge pull request #167 from mendableai/nsc/hyper-dx-integration
...
feat: HyperDX Integration
2024-05-21 13:19:38 -07:00
rafaelsideguide
75f4e34d8e
Merge branch 'main' into test/load-testing
2024-05-21 10:28:02 -03:00
rafaelsideguide
ec46065066
Update rate-limiter.ts
2024-05-21 10:07:27 -03:00
rafaelsideguide
6a3ac13fe1
Update load-test.yml
2024-05-21 10:06:02 -03:00
youqiang
c47dae13a9
update: wait until body attached in playwright-service
2024-05-21 14:53:57 +08:00
Nicolas
7f64fe884a
Update blocklist.ts
2024-05-20 17:26:01 -07:00
Nicolas
756f54466d
Nick: allowed keywords for now
2024-05-20 17:24:21 -07:00
Nicolas
01783dc336
Update openapi.json
2024-05-20 17:10:55 -07:00
Nicolas
77a79b5a79
Nick: max num tokens for llm extract (for now) + slice the max
2024-05-20 17:07:38 -07:00
Nicolas
2644e1c029
Update .env.example
2024-05-20 13:36:51 -07:00
Nicolas
9e61d431f0
Nick: hyper dx integration init
2024-05-20 13:36:34 -07:00
Nicolas
d5d0d48848
Merge branch 'main' of https://github.com/mendableai/firecrawl
2024-05-20 10:06:52 -07:00
Nicolas
60002e79b8
Nick: python sdk bump
2024-05-20 10:06:48 -07:00
Matt Joyce
7e5ef4dec4
Allow override of API URL
...
Allows python sdk to be used with local installs.
2024-05-20 18:46:32 +10:00
Nicolas
c74f757b53
Update rate-limiter.ts
2024-05-19 13:05:36 -07:00
Nicolas
98a39b39ab
Nick: increased rate limits
2024-05-19 12:59:29 -07:00
Nicolas
18fa15df25
Update index.test.ts
2024-05-19 12:50:06 -07:00
Nicolas
614c073af0
Nick: improvements
2024-05-19 12:45:46 -07:00
Nicolas
f473793ba3
Merge branch 'main' into feat/rate-limits
2024-05-19 12:23:34 -07:00
Nicolas
4efebf7a4b
Merge branch 'test/load-testing' of https://github.com/mendableai/firecrawl into test/load-testing
2024-05-19 12:22:51 -07:00
Nicolas
5792cd022c
Update fly.staging.toml
2024-05-19 12:22:49 -07:00
rafaelsideguide
d667e1417b
added fly staging load test
...
- being rate limited. Need to add the token to the rate-limit functions
2024-05-17 19:09:19 -03:00
Nicolas
7630565c26
Create fly.staging.toml
2024-05-17 14:33:59 -07:00
rafaelsideguide
7297b21dcd
Added load testing using artillery
2024-05-17 18:32:44 -03:00
rafaelsideguide
a480595aa7
Update index.test.ts
2024-05-17 15:41:27 -03:00
rafaelsideguide
54049be539
Added e2e tests
2024-05-17 15:37:47 -03:00
Nicolas
6feb21cc35
Update website_params.ts
2024-05-17 11:21:26 -07:00
Nicolas
5be208f595
Nick: fixed
2024-05-17 10:40:44 -07:00
Nicolas
eb88447e8b
Update index.test.ts
2024-05-17 10:00:05 -07:00
Nicolas
df6c3d1e7d
Merge branch 'main' into detect-pdfs
2024-05-17 09:55:51 -07:00
Nicolas
9d635cb2a3
Nick: docx support
2024-05-16 11:48:02 -07:00
Nicolas
bcce0544e7
Update openapi.json
2024-05-16 11:03:32 -07:00
Nicolas
80250fb54f
Update index.test.ts
2024-05-15 17:40:46 -07:00
Nicolas
098db17913
Update index.ts
2024-05-15 17:37:09 -07:00
Nicolas
93b1f0334e
Update index.test.ts
2024-05-15 17:35:06 -07:00
Nicolas
123fb784ca
Update index.test.ts
2024-05-15 17:29:22 -07:00
Nicolas
4a6cfb6097
Update index.test.ts
2024-05-15 17:22:29 -07:00
Nicolas
6ca368327f
Merge branch 'main' into test/crawl-options
2024-05-15 17:18:25 -07:00
Nicolas
24be4866c5
Nick:
2024-05-15 17:16:20 -07:00
Nicolas
ade4e05cff
Nick: working
2024-05-15 17:13:04 -07:00
Nicolas
bfccaf670d
Nick: fixes most of it
2024-05-15 15:30:37 -07:00
rafaelsideguide
d91043376c
not working yet
2024-05-15 18:54:40 -03:00
rafaelsideguide
fa014defc7
Fixing child links only bug
2024-05-15 18:35:09 -03:00
Nicolas
2ba743fb1a
Merge pull request #27 from eltociear/patch-1
...
refactor: fix typo in WebScraper/index.ts
2024-05-15 13:28:38 -07:00
Nicolas
0663d78324
Merge pull request #119 from chand1012/main
...
Add Docker Compose for easy self hosting
2024-05-15 13:27:40 -07:00
rafaelsideguide
da8d94105d
fixed for testing the crawl algorithm only
2024-05-15 17:16:03 -03:00
Nicolas
95ffaa2236
Update crawl.test.ts
2024-05-15 12:58:02 -07:00
Nicolas
f15b8f855e
Update crawl.json
2024-05-15 12:57:24 -07:00
Nicolas
98dd672d0a
Update crawl.json
2024-05-15 12:55:04 -07:00
Nicolas
499671c87f
Update crawl.test.ts
2024-05-15 12:50:13 -07:00
Nicolas
58053eb423
Update rate-limiter.ts
2024-05-15 12:47:35 -07:00
Nicolas
4745d114be
Update crawl.test.ts
2024-05-15 12:42:14 -07:00
Nicolas
1601e93d69
Merge branch 'main' into test/crawl-options
2024-05-15 12:34:47 -07:00
Nicolas
3678d3c986
Merge branch 'main' of https://github.com/mendableai/firecrawl
2024-05-15 12:11:18 -07:00
Nicolas
fd82982a31
Nick:
2024-05-15 12:11:16 -07:00
rafaelsideguide
4925ee59f6
added crawl test suite
2024-05-15 15:50:50 -03:00
Nicolas
1b0d6341d3
Update index.ts
2024-05-15 11:48:12 -07:00
Nicolas
d10f81e7fe
Nick: fixes
2024-05-15 11:28:20 -07:00
Nicolas
87570bdfa1
Update index.ts
2024-05-15 11:06:03 -07:00
rafaelsideguide
d4574851be
Added rpc definition
2024-05-15 08:40:21 -03:00
rafaelsideguide
47c20c80ab
Update auth.ts
2024-05-15 08:34:49 -03:00
Ikko Eltociear Ashimine
e91c122c69
Merge branch 'main' into patch-1
2024-05-15 12:14:52 +09:00
Nicolas
7d8ceab6de
Merge branch 'feat/rate-limits' of https://github.com/mendableai/firecrawl into feat/rate-limits
2024-05-14 14:48:01 -07:00
Nicolas
0e0faa28b3
Update auth.ts
2024-05-14 14:47:36 -07:00
rafaelsideguide
672eddb999
updated rpc
2024-05-14 18:47:21 -03:00
Nicolas
4761ea510b
Update rate-limiter.ts
2024-05-14 14:26:42 -07:00
rafaelsideguide
40ad97dee8
added rate limits
2024-05-14 18:08:31 -03:00
Nicolas
27e1e22a0a
Update index.test.ts
2024-05-14 12:28:25 -07:00
Nicolas
a0fdc6f7c6
Nick:
2024-05-14 12:12:40 -07:00
Nicolas
7f31959be7
Nick:
2024-05-14 12:04:36 -07:00
Nicolas
8a72cf556b
Nick:
2024-05-13 21:10:58 -07:00
Nicolas
26a092f780
Update index.ts
2024-05-13 21:04:49 -07:00
Nicolas
8101cbee37
Update index.ts
2024-05-13 21:02:47 -07:00
Nicolas
86b8439844
Nick:
2024-05-13 20:51:42 -07:00
Nicolas
a96fc5b96d
Nick: 4x speed
2024-05-13 20:45:11 -07:00
Nicolas
e26008a833
Merge branch 'main' of https://github.com/mendableai/firecrawl
2024-05-13 19:54:13 -07:00
Nicolas
512449e1aa
Nick: v21
2024-05-13 19:54:12 -07:00
Nicolas
bd27b0e17e
Merge pull request #142 from mendableai/doc/crawl-limit-default
...
[Doc] Added default value for crawlOptions.limit
2024-05-13 18:38:09 -07:00
Nicolas
aa0c8188c9
Nick: 408 handling
2024-05-13 18:34:00 -07:00
Nicolas
999176d576
Merge branch 'main' of https://github.com/mendableai/firecrawl
2024-05-13 13:57:34 -07:00
Nicolas
f3ec21d9c4
Update runWebScraper.ts
2024-05-13 13:57:22 -07:00
Nicolas
65d89afba9
Nick:
2024-05-13 13:01:43 -07:00
Eric Ciarla
4cc46d4af8
Update models.ts
2024-05-13 15:23:31 -04:00
rafaelsideguide
8eb2e95f19
Cleaned up
2024-05-13 16:13:10 -03:00
Nicolas
2ce045912f
Nick: disable vision right now
2024-05-13 10:56:08 -07:00
rafaelsideguide
f4348024c6
Added check during scraping to deal with pdfs
...
Checks if the URL is a PDF during the scraping process (single_url.ts).
TODO: Run integration tests - Does this strat affect the running time?
ps. Some comments need to be removed if we decide to proceed with this strategy.
2024-05-13 09:13:42 -03:00
Rafael Miller
5a2712fa5a
Merge branch 'main' into detect-pdfs
2024-05-10 15:53:13 -03:00
rafaelsideguide
bc6b929b43
[Bug] Fixing /crawl limit
2024-05-10 12:15:54 -03:00
rafaelsideguide
df16890f84
Added default value for crawlOptions.limit
2024-05-10 11:59:33 -03:00
rafaelsideguide
18480b2005
Removed .env.example, improved docs and docker compose envs
2024-05-10 11:38:17 -03:00
Nicolas
66bd1e4020
Update website_params.ts
2024-05-09 18:41:15 -07:00
Nicolas
c02a82c282
Update main.py
2024-05-09 18:02:34 -07:00
Nicolas
efc6fcb474
Merge branch 'main' of https://github.com/mendableai/firecrawl
2024-05-09 18:01:04 -07:00
Nicolas
73687822ad
Update main.py
2024-05-09 18:00:58 -07:00
Nicolas
d21091bb06
Update single_url.ts
2024-05-09 17:52:46 -07:00
Nicolas
be85008622
Nick: better
2024-05-09 17:48:11 -07:00
Nicolas
be5661a768
Nick: a lot better
2024-05-09 17:45:16 -07:00
Nicolas
fce17e6beb
Update credit_billing.ts
2024-05-09 15:29:58 -07:00
rafaelsideguide
f4d8b2c89a
Updated docs
2024-05-09 10:36:56 -03:00
Nicolas
aa6b84c5fa
Nick: readme
2024-05-08 17:41:15 -07:00
Nicolas
d9da4b53f8
Update example.py
2024-05-08 17:36:40 -07:00
Nicolas
4c88d5da66
Nick: v8 python
2024-05-08 17:35:16 -07:00
Nicolas
e6dbbf1bab
Nick: fixes js and pydantic implementation
2024-05-08 17:16:59 -07:00
Nicolas
c89964b230
Nick:
2024-05-08 16:38:49 -07:00
Nicolas
9541ff6b30
Nick: 429 addressed
2024-05-08 15:14:39 -07:00
Nicolas
3bfef646e0
Update index.test.ts
2024-05-08 13:23:53 -07:00
Nicolas
6ced8e73a7
Update index.test.ts
2024-05-08 13:13:38 -07:00
Nicolas
c50076c377
Update websites.json
2024-05-08 13:04:17 -07:00
Nicolas
1296928879
Update index.test.ts
2024-05-08 13:00:20 -07:00
Nicolas
4a5f87623c
Merge pull request #118 from mendableai/feat/test-suite
...
[Test] Added integration tests suite
2024-05-08 12:47:17 -07:00
Nicolas
fb7a8fd73f
Delete test_screenshot.png
2024-05-08 12:39:32 -07:00
Nicolas
c635688ddb
Nick: test suite
2024-05-08 12:36:54 -07:00
Nicolas
d34b4de6ac
Update websites.json
2024-05-08 12:27:45 -07:00
Nicolas
a0a67f124a
Update index.test.ts
2024-05-08 12:26:04 -07:00
Nicolas
b7e3104c7b
Ni
2024-05-08 12:18:53 -07:00
Nicolas
ad58bc2820
Nick: test suite init
2024-05-08 11:38:46 -07:00
rafaelsideguide
3f460af6c5
Added idempotency key to crawl route
2024-05-07 15:29:27 -03:00
Eric Ciarla
d280bcadf3
Add keyAuth
2024-05-07 13:52:42 -04:00
Nicolas
056b0ec24d
Merge branch 'main' into feat/test-suite
2024-05-07 10:41:09 -07:00
Nicolas
dcedb8d798
Merge branch 'main' into feat/max-depth
2024-05-07 10:20:49 -07:00
Nicolas
6505bf6bf2
Merge branch 'main' into feat/max-depth
2024-05-07 10:20:44 -07:00
Nicolas
bdbee963f7
Merge branch 'main' into nsc/cancel-job
2024-05-07 10:13:43 -07:00
rafaelsideguide
61d615c04b
Added tests
2024-05-07 14:03:00 -03:00
rafaelsideguide
e1f52c538f
nested includeHtml inside pageOptions
2024-05-07 13:40:24 -03:00
Nicolas
f46bf19fa5
Nick:
2024-05-07 09:26:52 -07:00
rafaelsideguide
83f3408634
Added max depth option
2024-05-07 11:06:26 -03:00
Nicolas
2e3ff85509
Update crawl-cancel.ts
2024-05-06 17:22:16 -07:00
Nicolas
6d5da358cc
Nick: cancel job
2024-05-06 17:16:43 -07:00
rafaelsideguide
509250c4ef
changed to includeHtml
2024-05-06 19:45:56 -03:00
rafaelsideguide
538355f1af
Added toMarkdown option
2024-05-06 11:36:44 -03:00
Nicolas
d1b6f6dcde
Update fly.toml
2024-05-04 13:49:09 -07:00
Nicolas
cd9a0840b5
Update search.ts
2024-05-04 13:13:15 -07:00
Nicolas
5229a4902b
Update search.ts
2024-05-04 13:09:11 -07:00
Nicolas
ce7bab7b35
Update status.ts
2024-05-04 13:00:38 -07:00
Nicolas
15b774e974
Update index.ts
2024-05-04 12:44:30 -07:00
Nicolas
67f135a5b6
Update crawl-status.ts
2024-05-04 12:31:28 -07:00
Nicolas
2aa09a3000
Nick: partial docs working, cleaner
2024-05-04 12:30:12 -07:00
Nicolas
00373228fa
Update index.ts
2024-05-04 11:53:16 -07:00
rafaelsideguide
fbb4c63a1a
[Test] Added integration tests suite
...
solves #15
2024-05-03 17:23:25 -03:00
Nicolas
21cdaf5996
Update log_job.ts
2024-05-02 12:40:49 -07:00
Eric Ciarla
caf3f9eede
Add Posthog Logging
2024-05-02 15:30:22 -04:00
Nicolas
8a95cb42f0
Update models.ts
2024-04-30 18:36:21 -07:00
Nicolas
4967536501
Update index.ts
2024-04-30 18:19:55 -07:00
Nicolas
768166b066
Update single_url.ts
2024-04-30 16:57:44 -07:00
Nicolas
a386259511
Update scrape.ts
2024-04-30 16:35:44 -07:00
Nicolas
dfcf39f4c0
Update scrape.ts
2024-04-30 16:19:59 -07:00
Nicolas
3c7030dbb1
Nick: improvements
2024-04-30 16:19:32 -07:00
Nicolas
cbd9e88b77
Merge branch 'main' into llm-extraction
2024-04-30 14:49:20 -07:00
Nicolas
4f526cff92
Nick: cleanup
2024-04-30 12:19:43 -07:00
Caleb Peffer
d9d206aff6
Caleb:
2024-04-30 10:27:39 -07:00
Caleb Peffer
d1235a0029
Caleb: switched back to markdown for extraction
2024-04-30 10:23:12 -07:00
Caleb Peffer
ad9c8e77d1
Caleb: commented out massive test
2024-04-30 10:22:09 -07:00
Caleb Peffer
a32f2b37b6
Caleb: logs work
2024-04-30 10:21:41 -07:00
Caleb Peffer
3ca9e5153f
Caleb: trying to get loggin workng
2024-04-30 09:20:15 -07:00
rafaelsideguide
a095e1b63d
Resolve merge conflicts with main
2024-04-30 10:54:18 -03:00
rafaelsideguide
35480bd2ad
Update index.test.ts
2024-04-30 10:40:32 -03:00
rafaelsideguide
d3c36adaa7
Update index.ts
2024-04-29 17:58:47 -03:00
Caleb Peffer
79cd7d2ebc
Merge branch 'llm-extraction' of https://github.com/mendableai/firecrawl into llm-extraction
2024-04-29 12:12:58 -07:00
Caleb Peffer
4f7737c922
Caleb: added ajv json schema validation.
2024-04-29 12:12:55 -07:00
rafaelsideguide
f8b207793f
changed the request to do a HEAD to check for a PDF instead
2024-04-29 15:15:32 -03:00
Nicolas
b69feab916
Merge branch 'main' into llm-extraction
2024-04-29 08:40:44 -07:00
Rafael Miller
71bdbf9f15
Merge pull request #67 from mendableai/feat/python-sdk-502
...
[Feat] Implemented retry attempts to handle 502 errors
2024-04-29 08:38:19 -03:00
Caleb Peffer
667f740315
Caleb: converted llm response to json
2024-04-28 19:28:28 -07:00
Caleb Peffer
2ad7a58eb7
Caleb: first test passing
2024-04-28 17:38:20 -07:00
Caleb Peffer
06497729e2
Caleb: got it to a testable state I believe
2024-04-28 15:52:09 -07:00
Caleb Peffer
6ee1f2d3bc
Caleb: initially pulled inspiration code from https://github.com/mishushakov/llm-scraper
2024-04-28 13:59:35 -07:00
Nicolas
68838c9e0d
Update single_url.ts
2024-04-28 12:44:00 -07:00
Nicolas
d8ee4e90d6
Update website_params.ts
2024-04-28 11:47:25 -07:00
Nicolas
8e44696c4d
Nick:
2024-04-28 11:34:25 -07:00
Nicolas
1dc6458c6a
Update crawler.ts
2024-04-27 11:17:10 -07:00
Nicolas
0f694e0608
Update crawler.ts
2024-04-27 11:14:52 -07:00
tractorjuice
a5d38039f2
Add additional file extensions to crawler.ts
...
Add additional file extensions.
2024-04-27 11:03:27 +01:00
Nicolas
7689c31d35
Update credit_billing.ts
2024-04-26 14:36:19 -07:00
Nicolas
0a607b9efa
Merge branch 'main' into feat/coupons
2024-04-26 14:23:35 -07:00
Nicolas
fdf913e0f1
Update index.test.ts
2024-04-26 13:06:48 -07:00
Nicolas
8e32453424
Update auth.ts
2024-04-26 12:57:49 -07:00
rafaelsideguide
1f48998970
done
2024-04-26 16:27:31 -03:00
Nicolas
bb3da8df89
Update package.json
2024-04-26 11:28:31 -07:00
Nicolas
df96fade42
Merge branch 'main' into pr/68
2024-04-26 11:28:28 -07:00
Nicolas
d210a57a9b
Update credit_billing.ts
2024-04-26 10:24:36 -07:00
Nicolas
24e1bdec1b
Update credit_billing.ts
2024-04-26 10:14:29 -07:00
rafaelsideguide
06675d1fe3
almost finished
2024-04-26 11:42:49 -03:00
Nicolas
f368e94cee
Merge branch 'main' into mdp/dotenv_jest
2024-04-25 17:59:20 -07:00
Nicolas
3ac8724329
Update openapi.json
2024-04-25 13:28:07 -07:00
Nicolas
b7c7291b0e
Nick: v15
2024-04-25 12:49:10 -07:00
Nicolas
a32e16a9be
Nick: added /search to the python sdk
2024-04-25 11:20:35 -07:00
rafaelsideguide
6ea818fac8
Update version
2024-04-25 14:49:12 -03:00
Nicolas
f2af7408e8
Update main.py
2024-04-25 10:31:28 -07:00
Nicolas
a3911bfc67
Update index.ts
2024-04-25 10:00:35 -07:00
Rafael Miller
03d1c64ac8
Removed process.env call for API_KEY
2024-04-25 13:33:06 -03:00
Mark Percival
e8b8150b56
Chore: Add some basic jest tests
2024-04-25 14:30:33 +00:00
Mark Percival
a7be09e479
Fix: Remove dotenv from npm module
2024-04-25 14:16:14 +00:00
rafaelsideguide
d3ab2ea926
[Feat] Implemented retry attempts to handle 502 errors
2024-04-25 10:51:01 -03:00
rafaelsideguide
9c481e5e83
[Feat] Coupon system
...
WIP. Idea for solving #57
2024-04-25 10:05:53 -03:00
rafaelsideguide
75597f72a1
[Feat] Added allowed urls
...
FireCrawl should be able to scrape LinkedIn Articles (/pulse/*)
2024-04-25 08:39:45 -03:00
Nicolas
a59ddf1855
Nick: default to serper
2024-04-24 18:00:25 -07:00
Roger M
f2690f6909
Support for tbs, filter, lang, country and location with Serper search.
2024-04-25 01:35:17 +01:00
Nicolas
e7d385ad32
Update search.ts
2024-04-24 10:23:26 -07:00
Nicolas
877af4231b
Update openapi.json
2024-04-24 10:11:44 -07:00
Nicolas
307ea6f5ec
Nick: improvements to search
2024-04-24 10:11:01 -07:00
Rafael Miller
f189589da4
Merge pull request #34 from mendableai/nsc/returnOnlyUrls
...
Implements the ability for the crawler to output all the links it found, without scraping
2024-04-24 10:34:42 -03:00
rafaelsideguide
07e93ee5fd
Update requests.http
2024-04-24 10:32:35 -03:00
rafaelsideguide
942ac3b41c
Resolved merge conflicts between feat/added-anthropic-vision-api and main
2024-04-24 09:57:45 -03:00
Nicolas
3b5b868d0d
Update requests.http
2024-04-23 18:13:58 -07:00
Nicolas
8939ca570b
Merge branch 'main' into nsc/returnOnlyUrls
2024-04-23 18:05:48 -07:00
Nicolas
479fa2f7f8
Nick:
2024-04-23 17:46:32 -07:00
Nicolas
fdb2789eaa
Nick: added url as return param
2024-04-23 17:14:34 -07:00
Nicolas
3abfd6b4c1
Update search.ts
2024-04-23 17:06:48 -07:00
Nicolas
53cc4c396f
Update search.ts
2024-04-23 17:05:58 -07:00
Nicolas
734c76fc56
Merge branch 'main' into nsc/mvp-search
2024-04-23 17:04:31 -07:00
Nicolas
f0695c7123
Update single_url.ts
2024-04-23 17:04:10 -07:00
Nicolas
4328a68ec1
Nick:
2024-04-23 16:57:53 -07:00
Nicolas
e6779aff68
Nick: tests
2024-04-23 16:56:09 -07:00
Nicolas
9ded75adb7
Merge branch 'main' into nsc/mvp-search
2024-04-23 16:52:40 -07:00
Nicolas
f3c190c21c
Nick:
2024-04-23 16:47:24 -07:00
Nicolas
41263bb4b6
Nick: serper support
2024-04-23 16:45:06 -07:00
Nicolas
8cb5d7955a
Update googlesearch.ts
2024-04-23 15:49:05 -07:00
Nicolas
495adc9a3f
Update googlesearch.ts
2024-04-23 15:48:37 -07:00
Nicolas
5e3e2ec966
Nick:
2024-04-23 15:44:11 -07:00
Nicolas
0146157876
Nick: mvp
2024-04-23 15:28:32 -07:00
rafaelsideguide
849c0b6ebf
[Feat] Added blocklist for social media urls
2024-04-23 18:50:35 -03:00
rafaelsideguide
9b01dc6281
Changed from active to waiting jobs
2024-04-23 16:07:22 -03:00
rafaelsideguide
a680c7ce84
[Feat] Server health check + slack message
2024-04-23 15:46:29 -03:00
Nicolas
306cfe4ce1
Nick:
2024-04-23 11:15:11 -07:00
Nicolas
357914c07d
Merge branch 'main' of https://github.com/mendableai/firecrawl
2024-04-23 10:55:42 -07:00
Nicolas
bf2df7a853
Nick: fix js-sdk
2024-04-23 10:55:40 -07:00
Nicolas
7bc7b179d4
Merge pull request #46 from mattzcarey/patch-1
...
chore: add context.close
2024-04-22 21:46:17 -07:00
Nicolas
de7e1f501b
Update openapi.json
2024-04-22 08:41:54 -07:00
Matt
572b7e8dc5
chore: add context.close
2024-04-22 16:38:05 +01:00
Nicolas
001bf0c504
Update package.json
2024-04-21 12:05:12 -07:00
Nicolas
6560c968e1
Update types.ts
2024-04-21 12:02:11 -07:00
Nicolas
52620bab16
Nick: prod and local-no-auth tests
2024-04-21 11:39:36 -07:00
Nicolas
749bd5f44d
Merge branch 'cjp/contributors-guide-and' of https://github.com/mendableai/firecrawl into cjp/contributors-guide-and
2024-04-21 11:27:37 -07:00
Nicolas
898d729a84
Nick: tests
2024-04-21 11:27:31 -07:00
Caleb Peffer
ef4ffd3a18
Adding contributors guide
2024-04-21 10:56:30 -07:00
Nicolas
5cdbf3a0ac
Nick: cleaner functions to handle authenticated requests that dont require ifs everywhere
2024-04-21 10:36:48 -07:00
Nicolas
aa89e2e8b5
Merge branch 'main' into cjp/contributors-guide-and
2024-04-21 10:10:05 -07:00
Caleb Peffer
be75aaa195
Caleb: first version of supabase proxy to make db authentication optional
2024-04-21 09:31:22 -07:00
Caleb Peffer
ad7951a679
Merge branch 'main' of https://github.com/mendableai/firecrawl into cjp/contributors-guide-and
2024-04-20 19:56:55 -07:00
Nicolas
d2f808a5fd
Update queue-worker.ts
2024-04-20 19:54:37 -07:00
Caleb Peffer
e6b46178dd
Caleb: added .env.example
2024-04-20 19:53:27 -07:00
Caleb Peffer
b361a76282
Caleb: added logging improvement
2024-04-20 19:53:04 -07:00
Nicolas
9b31e68a7e
Update queue-worker.ts
2024-04-20 19:38:44 -07:00
Nicolas
0db0874b00
Nick:
2024-04-20 19:37:45 -07:00
Nicolas
4543c57e4e
Nick:
2024-04-20 19:04:27 -07:00
Nicolas
5b8aed26dd
Update scrape.ts
2024-04-20 18:55:39 -07:00
Nicolas
23b2190e5d
Nick:
2024-04-20 16:38:05 -07:00
Nicolas
d201a4e58d
Merge pull request #31 from mendableai/feat/js-sdk-v0011
...
[Feat] Added type declarations
2024-04-20 14:15:16 -07:00
Nicolas
acec76680a
Merge pull request #35 from mendableai/nsc/job-logs
...
Better logging
2024-04-20 14:12:44 -07:00
Nicolas
5b3c75b06e
Nick:
2024-04-20 14:10:29 -07:00
Nicolas
43c2e877e7
Update index.ts
2024-04-20 14:05:01 -07:00
Nicolas
408c7a479f
Nick: rate limit fixes
2024-04-20 14:02:22 -07:00
Nicolas
6aa3cc3ce8
Nick:
2024-04-20 13:53:11 -07:00
Nicolas
1a3aa2999d
Nick: return the only list of urls
2024-04-20 11:59:42 -07:00
Nicolas
ddf9ff9c9a
Nick:
2024-04-20 11:46:06 -07:00
Nicolas
f1dd97af0f
Update index.ts
2024-04-19 15:37:27 -07:00
Nicolas
84cebf618b
Nick:
2024-04-19 15:36:00 -07:00
Nicolas
005ac8f839
Merge branch 'main' into detect-pdfs
2024-04-19 15:13:32 -07:00
Nicolas
5b93799149
Nick: a bit faster
2024-04-19 15:13:17 -07:00
rafaelsideguide
890bde686f
added type declarations
2024-04-19 19:10:05 -03:00
rafaelsideguide
37ef8a015c
fixing scrape preview test
2024-04-19 17:55:35 -03:00
Nicolas
c5cb268b61
Update pdfProcessor.ts
2024-04-19 13:13:42 -07:00
Nicolas
43cfcec326
Nick: disabling in crawl and sitemap for now
2024-04-19 13:12:08 -07:00
Nicolas
140529c609
Nick: fixes pdfs not found
2024-04-19 13:05:21 -07:00
Nicolas
15cfc01f5d
Merge branch 'main' of https://github.com/mendableai/firecrawl
2024-04-19 12:23:14 -07:00
Nicolas
a144e13e30
Update rate-limiter.ts
2024-04-19 12:23:13 -07:00
rafaelsideguide
384fb1db18
updating version
2024-04-19 15:27:54 -03:00
Rafael Miller
3c14b02f8b
Merge pull request #25 from mendableai/feat/replace-all-paths-to-absolute-paths
...
Added option to replace all relative paths with absolute paths
2024-04-19 15:18:50 -03:00
rafaelsideguide
3ddff62a56
adding better doc and types for js-sdk
2024-04-19 14:49:35 -03:00
Ikko Eltociear Ashimine
9e9d66f7a3
refactor: fix typo in WebScraper/index.ts
...
breakign -> breaking
2024-04-20 02:27:53 +09:00
rafaelsideguide
72e1dadccd
adding option to replace all relative paths with absolute paths
2024-04-19 11:47:20 -03:00
Nicolas
2c0660653d
Merge branch 'main' of https://github.com/mendableai/firecrawl
2024-04-18 13:56:25 -07:00
Nicolas
be35b32306
Nick: preview token tests
2024-04-18 13:55:55 -07:00
rafaelsideguide
c627d22179
all working now
2024-04-18 17:41:23 -03:00
rafaelsideguide
dab0568c43
testing tests
2024-04-18 17:38:12 -03:00
rafaelsideguide
3f833737f3
fixing test
2024-04-18 17:25:25 -03:00