Commit Graph

1281 Commits

Author SHA1 Message Date
rafaelsideguide 7a61325500 map + search + scrape markdown bug 2024-08-16 17:57:11 -03:00
Gergő Móricz 3fcb21930e remove log 2024-08-16 22:48:23 +02:00
Gergő Móricz f20328bdbb crawl status and document stuff 2024-08-16 22:48:05 +02:00
Nicolas 0c057bb649 Update index.test.ts 2024-08-16 16:45:10 -04:00
Nicolas b32464558a Update index.test.ts 2024-08-16 16:41:09 -04:00
Nicolas 5bac7988a6 Update index.test.ts 2024-08-16 16:08:38 -04:00
Nicolas 290c7ee936 Update index.test.ts 2024-08-16 16:06:46 -04:00
Nicolas 23a033fe61 Nick: fixes and more e2e tests 2024-08-16 16:03:35 -04:00
Nicolas 37ae9a9043 Update index.test.ts 2024-08-16 14:17:43 -04:00
Nicolas 200ce8e2ce Merge branch 'v1-webscraper' of https://github.com/mendableai/firecrawl into v1-webscraper 2024-08-16 14:16:35 -04:00
Nicolas 21d3798e49 Nick: initial e2e v1 tests for /scrape 2024-08-16 14:16:30 -04:00
rafaelsideguide 3f998b688d scrape ready 2024-08-16 15:14:37 -03:00
Nicolas b0d211ecc1 Merge branch 'main' into v1-webscraper 2024-08-16 13:43:28 -04:00
Gergő Móricz fd6432e7fd fix(queue-worker): correct job success 2024-08-16 19:16:08 +02:00
Gergő Móricz 6e54942265 fix(queue-worker): add cancelled to crawl log 2024-08-16 19:11:53 +02:00
rafaelsideguide 9b1cb266a0 added origin to request types 2024-08-16 13:49:50 -03:00
Gergő Móricz d0a8382a5b fix(queue-worker): crawl finishing race condition 2024-08-16 18:48:52 +02:00
Gergő Móricz 6bd52e63bf fix(queue-worker): fix linksOnPage undefined error 2024-08-16 18:42:24 +02:00
Gergő Móricz 5a6570cba2 fix(webhooks): call back with parent crawl ID 2024-08-16 17:42:42 +02:00
rafaelsideguide 7d324bd2c8 Create checkCredits.ts 2024-08-16 11:21:52 -03:00
Nicolas ec361609d2 Nick: added growth-2x plan 2024-08-15 18:37:19 -04:00
Gergő Móricz 8b7569f8f3 add zod, create middleware, update openapi declaration, add crawl logic 2024-08-15 23:30:33 +02:00
Gergő Móricz 4165de1773 v1 restructure 2024-08-15 21:51:59 +02:00
Gergő Móricz af08ab0b1a fix bad module resolution 2024-08-15 21:51:52 +02:00
Nicolas c917c8fbcd Merge branch 'main' into v1-webscraper 2024-08-15 15:14:29 -04:00
Nicolas 32c6b1f136 Nick: remove active job alerts 2024-08-15 14:50:30 -04:00
Gergő Móricz 0c14366720 fix: add checkandupdateurl to crawlPreview 2024-08-15 20:30:25 +02:00
Nicolas 81b2479db3
Merge pull request #459 from mendableai/feat/queue-scrapes
feat: Move scraper to queue
2024-08-15 14:19:55 -04:00
Gergő Móricz fc08ff450d search port 2024-08-15 20:10:59 +02:00
Nicolas 86326f34e9 Update single_url.test.ts 2024-08-15 13:48:42 -04:00
Gergő Móricz 129a882bcc fix(scrape): give scrapes their real job id 2024-08-15 19:29:47 +02:00
Gergő Móricz 965a5817d1 fix(queue-worker): log jobs correctly 2024-08-15 19:27:15 +02:00
Gergő Móricz dad9d353d9 use thomas's url validation 2024-08-15 19:19:02 +02:00
Gergő Móricz e3279274f1 fix: make playground crawl work 2024-08-15 19:14:32 +02:00
Gergő Móricz c5597bc722 fix: robots.txt laoding 2024-08-15 19:11:07 +02:00
Gergő Móricz 29f0d9ec94 propagate priority to fire-engine 2024-08-15 19:04:46 +02:00
Gergő Móricz b79d3d1754 fix 2024-08-15 19:02:05 +02:00
Gergő Móricz 57730f6a35 priority changes 2024-08-15 18:58:07 +02:00
Gergő Móricz 846610681b fix: fix posthog, add dummy crawl DB items 2024-08-15 18:55:18 +02:00
Nicolas 6e1074cdd1 Update website_params.ts 2024-08-14 17:39:54 -04:00
Thomas Kosmas 6410e1a81d Update params 2024-08-15 00:10:14 +03:00
Gergő Móricz 8a5cad72f6 fix(queue-worker): variable name collision 2024-08-14 22:02:05 +02:00
Gergő Móricz b8ec40dd72 fix(crawl): submit sitemapped jobs in bulk 2024-08-14 20:34:19 +02:00
Gergő Móricz 2ca1017fc3 fix(crawl): make request 0 of crawl jobs higher priority 2024-08-14 19:34:18 +02:00
Gergő Móricz f4466f6bb0 fix(test-suite): add artillery 2024-08-14 19:33:09 +02:00
Gergő Móricz cfad067a63 fix(fly): change proxy limits 2024-08-14 18:52:40 +02:00
Gergő Móricz a6c81f9d62 fix: return all data when calling webhook 2024-08-14 17:53:47 +02:00
Rafael Miller 84a2fe8697
Merge pull request #537 from mendableai/feat/go-sdk-submodule
[Feat] Added go-sdk as submodule
2024-08-14 08:25:24 -03:00
rafaelsideguide f86d2bb291 added go-sdk as submodule 2024-08-13 18:17:35 -03:00
Nicolas e2472b9b0d Merge remote-tracking branch 'origin/v1/mockup-controllers' into v1-webscraper 2024-08-13 16:34:57 -04:00