google-drive.mdx 1.2 KB

12345678910111213141516171819202122232425262728
  1. ---
  2. title: 'Google Drive'
  3. ---
  4. To use GoogleDriveLoader you must install the extra dependencies with `pip install --upgrade embedchain[googledrive]`.
  5. The data_type must be `google_drive`. Otherwise, it will be considered a regular web page.
  6. Google Drive requires the setup of credentials. This can be done by following the steps below:
  7. 1. Go to the [Google Cloud Console](https://console.cloud.google.com/apis/credentials).
  8. 2. Create a project if you don't have one already.
  9. 3. Enable the [Google Drive API](https://console.cloud.google.com/flows/enableapi?apiid=drive.googleapis.com)
  10. 4. [Authorize credentials for desktop app](https://developers.google.com/drive/api/quickstart/python#authorize_credentials_for_a_desktop_application)
  11. 5. When done, you will be able to download the credentials in `json` format. Rename the downloaded file to `credentials.json` and save it in `~/.credentials/credentials.json`
  12. 6. Set the environment variable `GOOGLE_APPLICATION_CREDENTIALS=~/.credentials/credentials.json`
  13. The first time you use the loader, you will be prompted to enter your Google account credentials.
  14. ```python
  15. from embedchain import Pipeline as App
  16. app = App()
  17. url = "https://drive.google.com/drive/u/0/folders/xxx-xxx"
  18. app.add(url, data_type="google_drive")
  19. ```