Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Published by Scroll Versions from space DEV and version next

In the Library

D toc

Excerpt

Review the assets to which you have access in the Library for Data page.

Tip

Tip: If you land in an empty Library for Data page, you can

...

start adding datasets. Click Import Data. See Import Data Page.

Image Added

D caption
typefigure
Library Page

To create a new imported dataset, click Import Data. See Import Data Page.
Filter by type:

Click one of the pre-defined filters to show datasets of the following types:

...

 for Data Page

Tabs:

  • All Data: You can view all the imported datasets or references available to

...

  • you.
  • Imported Datasets:

...

D s product
rtrue

...

  • Review your imported datasets from sources such as file-based storage, connected databases, or desktop. 

    • The Source column indicates where the original source data is located.

  • References:

...

  • Reference datasets are created from a recipe's output. 
  • Macros: Macros are sequence of steps that can be

...

  • reused in other's recipe. 

...

Click one of the pre-defined filters to show datasets of the following types:Columns:

  • Name: Name of the objectasset.
  • In FlowsOwner:  Count Owner of flows in which the object is in use. the asset.

  • Source:  Flow or datastore Location where the object asset is located.
  • Last Updatedupdated:  Timestamp of the last time that the object asset was modified.

Actions:

  • Browse: If displayed, use the page browsing controls to explore the available objects.
  • Search: To search object names, enter a string in the search bar. Results are highlighted immediately in the Library page.
  • Sort: Click a column header to sort the display by the column's entries.

...

  • Details: Review details about the dataset. See See Dataset Details Page.
  • Preview: Inspect a preview of the dataset.

    Info

    NOTE: Preview is not available for binary format sources.

  • Wrangle Use in New Flownew flow(Imported dataset only) You can create a new flow and begin immediately wrangling the dataset. This step also creates a recipe in the flow.
  • Add to Flowflow:  Add the dataset to a new or existing flow.

  • Make a copy: Create a copy of the imported dataset. This option is not available for reference datasets. 

  • Edit name Edit name and description: Change the name and description of the dataset.
  • Edit data settings: If the source of the imported dataset required conversion to an internally supported format, you can modify settings related to that conversion process. For more information, see File Import Settings.

    Tip

    Tip: This setting applies primarily to binary file formats, such as PDF and Excel, or file formats that may require additional steps to convert into tabular data, such as JSON.

  • Refresh dataset: If available, this option refreshes the dataset's metadata with the latest source schema.

Info

NOTE: When a dataset is refreshed, all samples associated with the dataset are deleted, whether the dataset has changed. Samples must be recreated in their recipes.


Info

NOTE: If you attempt to refresh the schema of a parameterized dataset based on a set of files, only the schema for the first file is checked for changes. If changes are detected, the other files are contain those changes as well. This can lead to changes being assumed or undetected in later files and potential data corruption.

For more information, see Overview of Schema Management.

  • Delete Dataset: Delete Transfer ownership: For assets that you own, you can transfer ownership of them to another user. For more information, see Transfer Asset Ownership.
  • Delete dataset: Delete the dataset.

    Warning

    Deleting a dataset cannot be undone.

Imported Datasets

Info

NOTE: Youcan only see the imported datasets to which you have access in your currently selected project or workspace. If the data underlying the imported dataset is not available, the imported dataset is still listed in the page, since it is just a reference to the data.

To create a new imported dataset, click Import Data. For more information, see Import Data Page.

For more information, see Imported Datasets Page.

References

A reference dataset is a reference to a recipe's output. For more information, see References Page.

Info

NOTE:  A reference dataset is a read-only object where it is referenced. A reference dataset must be created in the source flow from the recipe to use.

A reference dataset is created from the context menu of a flow's recipe. 

Macros

A macro is a saved sequence of one or more recipe steps that can be reused in other recipes. See Macros Page.You can either import macros from your desktop or browse through the

D s platform
rtrue
 community page for existing macros. For more information, see Import Macro.
D s also
inCQLtrue
label((label = "library_page") or (label = "imported_dataset") or (label = "macro") or (label = "reference_dataset"))