SSIS Azure Data Lake Store Destination Mapping Problem | Telefónica Tech

Company insights and events

Discover what's happening in our business, including corporate events, reports, press releases and insights.

Leadership team

Our key people

Meet our management, board members and team leaders and find out how they can help you.

Trusted resources and platforms

We work with the best names in the business, ensuring we can provide the highest quality services to our clients.

Working at Telefónica Tech

Join our team

Check out our current career opportunities and our company ethos and benefits.

Accreditations & awards

Industry expertise and recognition

Our extensive accreditations and awards provide our customers with assurance of best practise.

Social responsibility

Supporting colleagues and our communities

We put CSR at the core of our business, to support our colleagues, our customers and our communities and to enhance lives.

Contact our experts

Data-driven digital transformation

Harnessing the power of data and AI to drive growth and delight your customers.

Protect your data & tech

We help prevent cyber attacks to ensure your business remains safe, compliant and efficient.

Digital infrastructure that works for you

We are experts in complex digital transformation projects, so we can help you modernise your business beyond just cloud.

Business Applications

Delivering the digital enterprise

Transforming organisations, step-by-step, with connected intelligent applications and platforms.

Digital Workplace

Frictionless Collaboration, secure systems

We provide systems that make anytime, anywhere working a seamless experience.

Our solutions

Resilient digital solutions for essential services

We provide secure, scalable systems to help public services operate effectively and meet modern expectations.

Connected, secure healthcare solutions

Our systems support safe, compliant, and patient-focused care with seamless data integration.

Digital infrastructure for community safety

We deliver secure, efficient solutions to enhance police response and data protection.

Financial Services & Insurance

Secure, data-led resilience

We support financial institutions with robust, compliant systems to meet a dynamic market's needs.

Enhancing operations with data and AI

We help manufacturers streamline processes, boost efficiency, and drive innovation.

Intelligent solutions for modern retail

Our tools enable personalised customer experiences, operational efficiency, and growth through data.

Industries

Featured Events

Join, learn and connect.

Discover our upcoming events, webinars,
and conferences.

News that matters.

Stay informed with the latest company news
and press releases.

Thought Leadership Articles

Leading the conversation.

Expert insights on navigating industry challenges and innovative solutions.

Technical Blogs

Tech explained.

Comprehensive guides and detailed analysis from our technical experts.

Articles

SSIS Azure Data Lake Store Destination Mapping Problem

Blog | 6 October 2017 | Zach Stagers

The Problem

My current project involves Azure’s Data Lake Store and Analytics. We’re using the SSIS Azure Feature Pack‘s Azure Data Lake Store Destination to move data from our clients on premise system into the Lake, then using U-SQL to generate a delta file which goes on to be loaded into the warehouse. U-SQL is a “schema-on-read” language, which means you need a consistent and predictable format to be able to define the schema as you pull data out.

We ran in to an issue with this schema-on-read approach, but once you understand the issue, it’s simple to rectify. The Data Lake Store Destination task does not use the same column ordering which is shown in the destination mapping. Instead, it appears to rely on an underlying column identifier. This means that if you apply any conversions to a column in the data flow, this column will automatically be placed at the end of file- taking away the predictability of the file format, and potentially making your schema inconsistent if you have historic data in the Lake.

An Example

Create a simple package which pulls data from a flat file and moves it into the Lake.

Mappings of the Destination are as follows:

Running the package, and viewing the file in the Lake gives us the following (as we’d expect, based on the mappings):

Now add a conversion task – the one in my package just converts Col2 to a DT_I4, update the mappings in the destination, and run the package.

Open the file up in the Lake again, and you’ll find that Col2 is now at the end and contains the name of the input column, not the destination column:

The Fix

As mention in my “The Problem” section, the fix is extremely simple – just handle it in your U-SQL by re-ordering the columns appropriately during extraction! This article is more about giving a heads up and highlighting the problem, than a mind-blowing solution.