暫無描述

Sidharth Mohanty e8a2846449 Improve and add more tests (#807) 1 年之前
.github 4a8c50f886 [docs]: Revamp embedchain docs (#799) 1 年之前
configs adf50f1e81 [Docs] Add docs for Azure OpenAI provider (#804) 1 年之前
docs adf50f1e81 [Docs] Add docs for Azure OpenAI provider (#804) 1 年之前
embedchain e8a2846449 Improve and add more tests (#807) 1 年之前
embedchain-js 3c3d98b9c3 feat: add embedchain javascript package (#576) 1 年之前
examples 77c90a308e [Docs]: Clean up docs (#802) 1 年之前
notebooks adf50f1e81 [Docs] Add docs for Azure OpenAI provider (#804) 1 年之前
tests e8a2846449 Improve and add more tests (#807) 1 年之前
.env.example 702067e521 Improve user readability of .env.example (#781) 1 年之前
.gitignore adf50f1e81 [Docs] Add docs for Azure OpenAI provider (#804) 1 年之前
.pre-commit-config.yaml ac68986404 Add project tools and contributing guidelines (#281) 2 年之前
CITATION.cff 736b645fea Add citation (#137) 2 年之前
CONTRIBUTING.md c595003481 docs: setup docs for embedchain (#287) 2 年之前
LICENSE 65d1ff37e8 Create LICENSE 2 年之前
Makefile a86d7f52e9 [Feature]: Add support for creating app using yaml config (#787) 1 年之前
README.md 77c90a308e [Docs]: Clean up docs (#802) 1 年之前
image.jpg d0af018b8d Add support for image dataset (#571) 1 年之前
poetry.toml ac68986404 Add project tools and contributing guidelines (#281) 2 年之前
pyproject.toml e8a2846449 Improve and add more tests (#807) 1 年之前

README.md

embedchain

PyPI Slack Discord Twitter Substack Open in Colab codecov

Embedchain is a Data Platform for LLMs - load, index, retrieve, and sync any unstructured data. Using embedchain, you can easily create LLM powered apps over any data. If you want a javascript version, check out embedchain-js

Community

  • Join embedchain community on slack by accepting this invite

🤝 Schedule a 1-on-1 Session

Book a 1-on-1 Session with Taranjeet, the founder, to discuss any issues, provide feedback, or explore how we can improve Embedchain for you.

🔧 Quick install

pip install --upgrade embedchain

🔍 Demo

Try out embedchain in your browser:

Open in Colab

📖 Documentation

The documentation for embedchain can be found at docs.embedchain.ai.

💻 Usage

Embedchain empowers you to create ChatGPT like apps, on your own dynamic dataset.

Data Types Supported

  • Youtube video
  • PDF file
  • Web page
  • Sitemap
  • Doc file
  • Code documentation website loader
  • Notion and many more.

You can find the full list of data types on our documentation.

Queries

For example, you can use Embedchain to create an Elon Musk bot using the following code:

import os
from embedchain import App

# Create a bot instance
os.environ["OPENAI_API_KEY"] = "YOUR API KEY"
elon_bot = App()

# Embed online resources
elon_bot.add("https://en.wikipedia.org/wiki/Elon_Musk")
elon_bot.add("https://www.forbes.com/profile/elon-musk")
elon_bot.add("https://www.youtube.com/watch?v=RcYjXbSJBN8")

# Query the bot
elon_bot.query("How many companies does Elon Musk run and name those?")
# Answer: Elon Musk currently runs several companies. As of my knowledge, he is the CEO and lead designer of SpaceX, the CEO and product architect of Tesla, Inc., the CEO and founder of Neuralink, and the CEO and founder of The Boring Company. However, please note that this information may change over time, so it's always good to verify the latest updates.

🤝 Contributing

Contributions are welcome! Please check out the issues on the repository, and feel free to open a pull request. For more information, please see the contributing guidelines.

For more reference, please go through Development Guide and Documentation Guide.

Telemetry

We collect anonymous usage metrics to enhance our package's quality and user experience. This includes data like feature usage frequency and system info, but never personal details. The data helps us prioritize improvements and ensure compatibility. If you wish to opt-out, set the app.config.collect_metrics = False in the code. We prioritize data security and don't share this data externally.

Citation

If you utilize this repository, please consider citing it with:

@misc{embedchain,
  author = {Taranjeet Singh, Deshraj Yadav},
  title = {Embedchain: Data platform for LLMs - load, index, retrieve, and sync any unstructured data},
  year = {2023},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/embedchain/embedchain}},
}