I am working on a new Dataflow Gen 2 to import Excel files from SharePoint.
The problem I am running into is our vendor, who is supplying the files, is not properly setting the worksheet dimension field in the XML of the XLSX excel file.
The file as downloaded from our vendors site has excel.zip > xl\worksheets\sheet1.xml defined as <dimension ref="A1:A56812" />.
The dataflow Gen 2 is reading that and only showing the first column.
However, there are columns A - Y in the sheet. If we open the sheet in Excel, change any of the data and save the file, the dimension is defined as <dimension ref="A1:Y56812" /> and the Dataflow Gen 2 pulls in all the columns.
Is there a way to force the Dataflow Gen 2 to ignore the dimension attribute and import columns A - Y?
Or do I need to have "Open the sheet, change something, save the sheet" as part of the manual download tasks? I want to automate as much as possible.
DocumentFormat.OpenXmlwas doing the same thing because I didnt add columns correctly to the worksheet in the application code. Its a straight forward fix but getting our vendor to bother with it most likely wont happen.Options->Global->Data Load->Never detect column types & headers for unstructured sources?