Reve AI
리소스 마켓
MCP개발무료✦ Reve 추천

Jina Reader MCP

웹페이지를 LLM 친화적 마크다운으로. WebFetch보다 토큰 효율적.

💰 기본 무료, 대량 유료 11k

Local development

  • npm install
  • docker compose up -d
  • npm run init-db

VSCode F5 start debugger

OR after setting up appropriate environment variables:

  • docker compose up -d
  • npm run dev

Reader

Your LLMs deserve better input.

Reader does two things:

  • Read: It converts any URL to an LLM-friendly input with https://r.jina.ai/https://your.url. Get improved output for your agent and RAG systems at no cost.
  • Search: It searches the web for a given query with https://s.jina.ai/your+query. This allows your LLMs to access the latest world knowledge from the web.

Check out the live demo

Or just visit these URLs (Read) https://r.jina.ai/https://github.com/jina-ai/reader, (Search) https://s.jina.ai/Who%20will%20win%202024%20US%20presidential%20election%3F and see yourself.

Feel free to use Reader API in production. It is free, stable and scalable. We are maintaining it actively as one of the core products of Jina AI. Check out rate limit

Updates

  • 2024-10-08: Introduced an adaptive crawler. It can recursively crawl the website and extract the most relevant pages for a given webpage.
  • 2024-07-15: To restrict the results of s.jina.ai to certain domain/website, you can set e.g. site=jina.ai in the query parameters, which enables in-site search. For more options, try our updated live-demo.
  • 2024-07-01: We have resolved a DDoS attack and other traffic abusing since June 27th. We also found a bug introduced on June 28th which may cause higher latency for some websites. The attack and the bug have been solved; if you have experienced high latency of r.jina.ai between June 27th-30th, it should back to normal now.
  • 2024-05-30: Reader can now read abitrary PDF from any URL! Check out this PDF result from NASA.gov vs the original.
  • 2024-05-15: We introduced a new endpoint s.jina.ai that searches on the web and return top-5 results, each in a LLM-friendly format. Read more about this new feature here.
  • 2024-05-08: Image caption is off by default for better latency. To turn it on, set x-with-generated-alt: true in the request header.
  • 2024-05-03: We finally resolved a DDoS attack since April 29th. Now our API is much more reliable and scalable than ever!
  • 2024-04-24: You now have more fine-grained control over Reader API using headers, e.g. forwarding cookies, using HTTP proxy.
  • 2024-04-15: Reader now supports image reading! It captions all images at the specified URL and adds Image [idx]: [caption] as an alt tag (if they initially lack one). This enables downstream LLMs to interact with the images in reasoning, summarizing etc. See example here.

Usage

Using r.jina.ai for single URL fetching

Simply prepend https://r.jina.ai/ to any URL. For example, to convert the URL https://en.wikipedia.org/wiki/Artificial_intelligence to an LLM-friendly input, use the following URL:

https://r.jina.ai/https://en.wikipedia.org/wiki/Artificial_intelligence

Using r.jina.ai for a full website fetching (Google Colab)


GitHub에서 전체 내용 보기