Azure Data Lake Storage Gen2 Python SDK

by Unknown v1.0.0

This skill provides access to the Azure Data Lake Storage Gen2 SDK for Python. It allows users to interact with a hierarchical file system optimized for big data analytics. You can perform various operations such as creating, reading, updating, and deleting files and directories, managing access control lists (ACLs), and listing the contents of file systems.

Leverage this skill to build data pipelines, process large datasets, and manage your data lake storage. It supports both synchronous and asynchronous operations, providing flexibility for different application requirements. The skill also includes best practices for efficient data management and performance optimization.

What It Does

Enables interaction with Azure Data Lake Storage Gen2, allowing users to manage files, directories, and access control using Python.

When To Use

When you need to programmatically interact with Azure Data Lake Storage Gen2 for big data analytics, data processing, or file management tasks.

Inputs

Azure Storage Account URL, credentials (e.g., DefaultAzureCredential), file system names, directory paths, file paths, and data to upload.

Outputs

File system objects, directory objects, file objects, file content, properties, access control lists, metadata, and lists of paths.

Limitations

Requires an active Azure subscription and appropriate permissions to access Azure Data Lake Storage Gen2. Handling extremely large files might require careful memory management.

Installation

1. Open Cursor IDE.
2. Create a new Python project.
3. Install the Azure Data Lake Storage Gen2 SDK: `pip install azure-storage-file-datalake azure-identity`.
4. Import the required modules and use the provided code snippets to interact with your Azure Data Lake Storage Gen2 account.
5. Set the AZURE_STORAGE_ACCOUNT_URL environment variable.

View Cursor IDE documentation

1. Install Python 3.6 or later.
2. Install the Azure Data Lake Storage Gen2 SDK and Azure Identity library: `pip install azure-storage-file-datalake azure-identity`.
3. Set the `AZURE_STORAGE_ACCOUNT_URL` environment variable to point to your Azure Data Lake Storage Gen2 account endpoint.
4. Authenticate using `DefaultAzureCredential` or other appropriate credential types.
5. Use the provided code examples to perform operations on your data lake.

View Aider documentation

1. Install Python 3.6 or later.
2. Install the Azure Data Lake Storage Gen2 SDK and Azure Identity library: `pip install azure-storage-file-datalake azure-identity`.
3. Set the `AZURE_STORAGE_ACCOUNT_URL` environment variable to point to your Azure Data Lake Storage Gen2 account endpoint.
4. Authenticate using `DefaultAzureCredential` or other appropriate credential types.
5. Use the provided code examples to perform operations on your data lake.

View Claude (Anthropic) documentation

Have a Skill to Share?

Join the community and help AI agents learn new capabilities. Submit your skill and reach thousands of developers.