> For the complete documentation index, see [llms.txt](https://2435ghj424g6j.gitbook.io/gm/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://2435ghj424g6j.gitbook.io/gm/application/datalake.md).

# Datalake

**Existing Collections**

* **Collection Cards**: Each collection is represented by a card displaying essential information.
  * *Details*: Name of the collection, brief description, blockchain network, collection format, and visibility.
  * *Interaction*: Clicking on a collection card navigates to the collection's dedicated page.

### Explore Further

For additional information on viewing existing collections, please consult the [View Collection Page](https://github.com/Offsetdata/OD-Docs/blob/main/application/datalake/view-collection.md).

**Create New Collection**

* **Button**: Clicking the "Create New Collection" card opens a modal titled "Create New Data Collection."

**"Create New Data Collection" Modal**

* **Form Fields**:
  * *Collection Name*: Text input field to specify the name of the new collection.
  * *Description*: Text area to provide detailed information about the collection.
    * *Example Description*: "The IoT (m6) London LT56TGH dataset provides real-time and historical data on travel patterns, tourism, and transportation usage in London spanning 2019-2023.\
      The core dataset includes over 5 billion timestamped records across thousands of sensor streams. Key metrics tracked relate to:

      *Transportation usage*: Passenger levels on Tube lines, buses, trains, airports, taxis, roads. Speed/congestion data for roads/tubes. Origin-Destination flows.

      *Tourism activity*: Foot traffic in zones, visitor numbers and wait times at attractions. Hotel occupancy rates and bookings. Restaurant guest numbers and peak hours.
  * *Category*: Dropdown menu to select the category (e.g., Travel and Tourism Data).
  * *Blockchain Network*: Selection field for choosing the blockchain network (e.g., Polygon, Camino, BNB).
* **Submit Button**: Upon completing the form, the "Submit" button creates the new collection.
  * *Functionality*: Validates the input, creates a new collection on the selected blockchain, and dismisses the modal.

### Explore Further

For additional information on creating a new collection, please consult the [Create Collection Page](https://github.com/Offsetdata/OD-Docs/blob/main/application/datalake/new-collection.md).

## Summary

We've provided a brief overview of the functionality and options available in the Data Lake feature.

Within the Data Lake, users can view and edit existing collections and create new ones.

The Data Lake is ideal for one-time data selling or subscription-based usage scenarios, where data sellers can regularly update datasets with fresh data, granting buyers instant access to the latest information.

In the Data Lake, sellers can upload data via:

* **Files:** such as csv, xlsx, or json.
* **API Data Stream:** From within third-party applications that have access to [Offsetdata's API Endpoints](https://github.com/Offsetdata/OD-Docs/blob/main/OD-Docs/api/endpoints.md).


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter:

```
GET https://2435ghj424g6j.gitbook.io/gm/application/datalake.md?ask=<question>&goal=<endgoal>
```

`ask` is the immediate question: it should be specific, self-contained, and written in natural language.
`goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal.

The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
