Gergo Moricz
|
7e3a368684
|
fix: unpause globally
|
2024-07-12 00:05:35 +02:00 |
|
Gergo Moricz
|
ee1d41406e
|
feat: unpause by http request
|
2024-07-11 23:56:36 +02:00 |
|
Gergo Moricz
|
f64a2d8668
|
fix: rename fly tomls to original
|
2024-07-11 23:21:02 +02:00 |
|
Gergo Moricz
|
bd84290b9e
|
fix: reenable hyperdx
|
2024-07-11 23:20:51 +02:00 |
|
Gergo Moricz
|
09bca05b20
|
feat: fix iteration 3 (actually works)
|
2024-07-11 23:14:15 +02:00 |
|
Gergo Moricz
|
9cd7d79b64
|
feat: avoid double SIGINT crashing
|
2024-07-11 20:35:15 +02:00 |
|
Gergo Moricz
|
eaa8db4b19
|
fix(fly): raise kill timeout for graceful shutdown
|
2024-07-11 20:09:06 +02:00 |
|
Gergo Moricz
|
bffb9f8fd0
|
feat: stuck job restoration iteration 2
|
2024-07-11 20:08:21 +02:00 |
|
rafaelsideguide
|
86d0e88a91
|
removed hyperdx (they also have graceful shutdown) and tried to change the process for running on server. It didn't work.
|
2024-07-10 18:29:55 -03:00 |
|
rafaelsideguide
|
7c3cc89a80
|
Merge branch 'feat/save-docs-on-supabase' of https://github.com/mendableai/firecrawl into feat/save-docs-on-supabase
|
2024-07-09 18:48:29 -03:00 |
|
Gergo Moricz
|
1a07e9d23b
|
feat: pick up and commit interrupted jobs from/to DB
|
2024-07-09 15:57:38 +02:00 |
|
Gergo Moricz
|
6a524e1bae
|
feat: pick up and commit interrupted jobs from/to DB
|
2024-07-09 14:56:47 +02:00 |
|
Gergo Moricz
|
77aa46588f
|
feat: graceful exit handler
|
2024-07-09 14:29:32 +02:00 |
|
Nicolas
|
914897c9d2
|
Merge branch 'main' into feat/save-docs-on-supabase
|
2024-07-05 12:27:22 -03:00 |
|
rafaelsideguide
|
d4e1a9724f
|
Merge branch 'dependabot/npm_and_yarn/apps/test-suite/dev-deps-ffe2a14739'
|
2024-07-05 11:16:49 -03:00 |
|
Rafael Miller
|
c570fa92cf
|
Merge pull request #347 from mendableai/dependabot/npm_and_yarn/apps/test-suite/prod-deps-d16537e256
apps/test-suite(deps): bump the prod-deps group in /apps/test-suite with 6 updates
|
2024-07-05 10:18:35 -03:00 |
|
Nicolas
|
a7aaa7e57e
|
Update SELF_HOST.md
|
2024-07-04 17:49:09 -03:00 |
|
Nicolas
|
5551f704ba
|
Merge pull request #362 from snippet/self-host-docs-ts-playwright-service
(Docs) Self Host added new ts playwright service instructions
|
2024-07-04 17:48:41 -03:00 |
|
Nicolas
|
8f46b8218a
|
Merge pull request #361 from snippet/ts-playwright-service-docker
setting up docker to ts playwright service
|
2024-07-04 17:47:41 -03:00 |
|
Nicolas
|
32849b017f
|
Nick:
|
2024-07-03 20:18:11 -03:00 |
|
Nicolas
|
5ecd9cb6f5
|
Merge pull request #363 from mendableai/nsc/logging-scrapers
Logging for all scraper methods
|
2024-07-03 18:47:22 -03:00 |
|
Nicolas
|
066d92f643
|
Update single_url.ts
|
2024-07-03 18:38:17 -03:00 |
|
Nicolas
|
f5b2fbd7e8
|
Nick: revision
|
2024-07-03 18:06:53 -03:00 |
|
Nicolas
|
2d30cc6117
|
Nick: comments
|
2024-07-03 18:01:54 -03:00 |
|
Nicolas
|
90c54c32fd
|
Nick: refactor
|
2024-07-03 18:01:17 -03:00 |
|
Nicolas
|
90cf799a3c
|
Update single_url.ts
|
2024-07-03 17:56:21 -03:00 |
|
Nicolas
|
b36406e465
|
Nick: log scrpaers
|
2024-07-03 17:28:53 -03:00 |
|
Jeff Pereira
|
8d09c5f9b5
|
(Docs) Self Host added new ts playwright service instructions
|
2024-07-03 12:00:44 -07:00 |
|
Jeff Pereira
|
b4292c1ea3
|
setting up docker to ts playwright service
|
2024-07-03 11:55:39 -07:00 |
|
Nicolas
|
abb44bb112
|
Merge pull request #346 from mendableai/dependabot/pip/apps/playwright-service/prod-deps-8f04296377
apps/playwright-service(deps): bump the prod-deps group in /apps/playwright-service with 3 updates
|
2024-07-03 01:07:09 -03:00 |
|
Nicolas
|
f967daddcb
|
Merge pull request #325 from snippet/playwright-scraper-api
new playwright service
|
2024-07-03 01:04:52 -03:00 |
|
Eric Ciarla
|
2d0d5ac392
|
Update for llm-extraction-from-raw-html
|
2024-07-02 14:05:42 -04:00 |
|
rafaelsideguide
|
0175152577
|
Fixed PDF match custom scraping
Now it's working for both `https://getgc.ai/privacy` and `https://prairie.cards/products/wood-designs` usecases.
|
2024-07-02 11:25:17 -03:00 |
|
rafaelsideguide
|
96de948d6b
|
Update index.test.ts
|
2024-07-02 11:04:09 -03:00 |
|
rafaelsideguide
|
7b7154ba1e
|
bugfixed pageStatusCode
|
2024-07-02 10:51:35 -03:00 |
|
Rafael Miller
|
50eecf04a9
|
Update licence pyproject.toml
Closes #345
|
2024-07-02 10:01:49 -03:00 |
|
dependabot[bot]
|
5bda5ec81d
|
apps/test-suite(deps-dev): bump typescript
Bumps the dev-deps group in /apps/test-suite with 1 update: [typescript](https://github.com/Microsoft/TypeScript).
Updates `typescript` from 5.4.5 to 5.5.3
- [Release notes](https://github.com/Microsoft/TypeScript/releases)
- [Changelog](https://github.com/microsoft/TypeScript/blob/main/azure-pipelines.release.yml)
- [Commits](https://github.com/Microsoft/TypeScript/compare/v5.4.5...v5.5.3)
---
updated-dependencies:
- dependency-name: typescript
dependency-type: direct:development
update-type: version-update:semver-minor
dependency-group: dev-deps
...
Signed-off-by: dependabot[bot] <support@github.com>
|
2024-07-02 12:48:10 +00:00 |
|
dependabot[bot]
|
ad3e73b445
|
apps/test-suite(deps): bump the prod-deps group
Bumps the prod-deps group in /apps/test-suite with 6 updates:
| Package | From | To |
| --- | --- | --- |
| [@anthropic-ai/sdk](https://github.com/anthropics/anthropic-sdk-typescript) | `0.20.8` | `0.24.3` |
| [@dqbd/tiktoken](https://github.com/dqbd/tiktoken) | `1.0.14` | `1.0.15` |
| [@supabase/supabase-js](https://github.com/supabase/supabase-js) | `2.43.1` | `2.44.2` |
| [openai](https://github.com/openai/openai-node) | `4.40.2` | `4.52.2` |
| [playwright](https://github.com/microsoft/playwright) | `1.43.1` | `1.45.0` |
| [ts-jest](https://github.com/kulshekhar/ts-jest) | `29.1.2` | `29.1.5` |
Updates `@anthropic-ai/sdk` from 0.20.8 to 0.24.3
- [Release notes](https://github.com/anthropics/anthropic-sdk-typescript/releases)
- [Changelog](https://github.com/anthropics/anthropic-sdk-typescript/blob/main/CHANGELOG.md)
- [Commits](https://github.com/anthropics/anthropic-sdk-typescript/compare/sdk-v0.20.8...sdk-v0.24.3)
Updates `@dqbd/tiktoken` from 1.0.14 to 1.0.15
- [Release notes](https://github.com/dqbd/tiktoken/releases)
- [Changelog](https://github.com/dqbd/tiktoken/blob/main/CHANGELOG.md)
- [Commits](https://github.com/dqbd/tiktoken/compare/@dqbd/tiktoken@1.0.14...@dqbd/tiktoken@1.0.15)
Updates `@supabase/supabase-js` from 2.43.1 to 2.44.2
- [Release notes](https://github.com/supabase/supabase-js/releases)
- [Changelog](https://github.com/supabase/supabase-js/blob/master/RELEASE.md)
- [Commits](https://github.com/supabase/supabase-js/compare/v2.43.1...v2.44.2)
Updates `openai` from 4.40.2 to 4.52.2
- [Release notes](https://github.com/openai/openai-node/releases)
- [Changelog](https://github.com/openai/openai-node/blob/master/CHANGELOG.md)
- [Commits](https://github.com/openai/openai-node/compare/v4.40.2...v4.52.2)
Updates `playwright` from 1.43.1 to 1.45.0
- [Release notes](https://github.com/microsoft/playwright/releases)
- [Commits](https://github.com/microsoft/playwright/compare/v1.43.1...v1.45.0)
Updates `ts-jest` from 29.1.2 to 29.1.5
- [Release notes](https://github.com/kulshekhar/ts-jest/releases)
- [Changelog](https://github.com/kulshekhar/ts-jest/blob/main/CHANGELOG.md)
- [Commits](https://github.com/kulshekhar/ts-jest/compare/v29.1.2...v29.1.5)
---
updated-dependencies:
- dependency-name: "@anthropic-ai/sdk"
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: prod-deps
- dependency-name: "@dqbd/tiktoken"
dependency-type: direct:production
update-type: version-update:semver-patch
dependency-group: prod-deps
- dependency-name: "@supabase/supabase-js"
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: prod-deps
- dependency-name: openai
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: prod-deps
- dependency-name: playwright
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: prod-deps
- dependency-name: ts-jest
dependency-type: direct:production
update-type: version-update:semver-patch
dependency-group: prod-deps
...
Signed-off-by: dependabot[bot] <support@github.com>
|
2024-07-02 12:47:58 +00:00 |
|
dependabot[bot]
|
60de6bb6e3
|
apps/playwright-service(deps): bump the prod-deps group
Bumps the prod-deps group in /apps/playwright-service with 3 updates: [hypercorn](https://github.com/pgjones/hypercorn), [fastapi](https://github.com/tiangolo/fastapi) and [playwright](https://github.com/Microsoft/playwright-python).
Updates `hypercorn` from 0.16.0 to 0.17.3
- [Changelog](https://github.com/pgjones/hypercorn/blob/main/CHANGELOG.rst)
- [Commits](https://github.com/pgjones/hypercorn/compare/0.16.0...0.17.3)
Updates `fastapi` from 0.110.0 to 0.111.0
- [Release notes](https://github.com/tiangolo/fastapi/releases)
- [Commits](https://github.com/tiangolo/fastapi/compare/0.110.0...0.111.0)
Updates `playwright` from 1.42.0 to 1.44.0
- [Release notes](https://github.com/Microsoft/playwright-python/releases)
- [Commits](https://github.com/Microsoft/playwright-python/compare/v1.42.0...v1.44.0)
---
updated-dependencies:
- dependency-name: hypercorn
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: prod-deps
- dependency-name: fastapi
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: prod-deps
- dependency-name: playwright
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: prod-deps
...
Signed-off-by: dependabot[bot] <support@github.com>
|
2024-07-02 12:47:09 +00:00 |
|
Rafael Miller
|
3d530b461b
|
Merge pull request #337 from Sanix-Darker/f/cleaner-docker-compose
[PROPOSAL] (docker-compose) regroup envs vars between services
|
2024-07-02 09:46:22 -03:00 |
|
Rafael Miller
|
46ddc813e0
|
Merge pull request #338 from Sanix-Darker/dependabot
[PROPOSAL] (deps): making sure all deps are always up to date
|
2024-07-02 09:46:08 -03:00 |
|
Rafael Miller
|
f0f449fe51
|
Merge pull request #336 from snippet/allow-external-content-links
[Proposal] new feature allowExternalContentLinks
|
2024-07-02 09:45:21 -03:00 |
|
rafaelsideguide
|
db4a743365
|
Added e2e test
|
2024-07-02 09:44:08 -03:00 |
|
Eric Ciarla
|
0821017f5b
|
Update README.md
|
2024-07-02 07:08:46 -04:00 |
|
Nicolas
|
42cd58a679
|
Merge pull request #332 from mendableai/feat/rawHtmlExtraction
Adds pageOptions.includeRawHtml and new extraction mode "llm-extraction-from-raw-html"
|
2024-07-01 18:23:26 -03:00 |
|
Nicolas
|
c4f423981f
|
Update pnpm-lock.yaml
|
2024-07-01 18:22:22 -03:00 |
|
rafaelsideguide
|
16aac7f8c5
|
Update single_url.ts
|
2024-07-01 18:21:15 -03:00 |
|
Nicolas
|
6d0c7a9ccd
|
Merge pull request #323 from mendableai/tests/crawl-limit-unit-tests
[Tests] Added crawl limit unit test
|
2024-07-01 17:56:04 -03:00 |
|
rafaelsideguide
|
4d6e25619b
|
minor spacing and comment stuff
|
2024-07-01 16:05:34 -03:00 |
|
Eric Ciarla
|
e1af815f8c
|
Update scrape.ts
|
2024-07-01 08:48:21 -04:00 |
|