Python convert csv to xlsx

Question

In this post there is a Python example to convert from csv to xls.

However, my file has more than 65536 rows so xls does not work. If I name the file xlsx it doesnt make a difference. Is there a Python package to convert to xlsx?

starball · Accepted Answer · 2022-12-24 22:09:25Z

111

Here's an example using xlsxwriter:

import os
import glob
import csv
from xlsxwriter.workbook import Workbook


for csvfile in glob.glob(os.path.join('.', '*.csv')):
    workbook = Workbook(csvfile[:-4] + '.xlsx')
    worksheet = workbook.add_worksheet()
    with open(csvfile, 'rt', encoding='utf8') as f:
        reader = csv.reader(f)
        for r, row in enumerate(reader):
            for c, col in enumerate(row):
                worksheet.write(r, c, col)
    workbook.close()

FYI, there is also a package called openpyxl, that can read/write Excel 2007 xlsx/xlsm files.

edited Dec 24, 2022 at 22:09

starball♦

59.6k53 gold badges315 silver badges1k bronze badges

answered Jul 16, 2013 at 18:51

alecxe

476k127 gold badges1.1k silver badges1.2k bronze badges

Sign up to request clarification or add additional context in comments.

14 Comments

Ethan Over a year ago

Thanks for this very helpful code snippet. While using large files, it's better to use 'constant_memory' for controlled memory usage like: workbook = Workbook(csvfile + '.xlsx', {'constant_memory': True}). Ref: xlsxwriter.readthedocs.org/en/latest/working_with_memory.html

MrMobileMan Over a year ago

Nice... However, the xlsx files created are full of all number fields having errors that the fields are stored as text instead of numbers...

MrMobileMan Over a year ago

Found a fix to the numbers as text issue here: stackoverflow.com/questions/24971556/…

Diego Over a year ago

I had to add these lines to make it work with Western European languages import sys reload(sys) sys.setdefaultencoding('latin-1')

pookie Over a year ago

@MrMobileMan It is better to use the xlsxwriter constuctor option strings_to_numbers. For example, workbook = Workbook('output.xlsx',{'strings_to_numbers':True})

|

chfw · Accepted Answer · 2018-08-03 21:46:05Z

49

With my library pyexcel,

 $ pip install pyexcel pyexcel-xlsx

you can do it in one command line:

from pyexcel.cookbook import merge_all_to_a_book
# import pyexcel.ext.xlsx # no longer required if you use pyexcel >= 0.2.2 
import glob


merge_all_to_a_book(glob.glob("your_csv_directory/*.csv"), "output.xlsx")

Each csv will have its own sheet and the name will be their file name.

edited Aug 3, 2018 at 21:46

answered Oct 19, 2014 at 23:42

chfw

4,6122 gold badges32 silver badges32 bronze badges

6 Comments

MrMobileMan Over a year ago

Very nice... Thanks! I up-voted this one. One issue I'm having, however, is that both this and xlswriter create xlsx's full of errors that the text fields are formatted as text instead of numbers...

MrMobileMan Over a year ago

Found the fix to the numbers as text issue here... stackoverflow.com/questions/24971556/…

chfw Over a year ago

If additional formatting is needed, you may not use merge_all_to_a_book but use pyexcel.Sheet, with which you can use format() function to convert float into int first, then use sheet operations to merge them and save as csv.

chfw Over a year ago

with pyexcel-cli package and pyexcel, pyexcel-xlsx, you can do that in command line: $ pyexcel merge your_csv_directory/*.csv out.xlsx

Underoos Over a year ago

How to specify the sheet name if I have the only 1 file to be written to xlsx file?

|

wjandrea · Accepted Answer · 2023-05-04 18:49:48Z

37

Simple two line code solution using pandas

import pandas as pd

read_file = pd.read_csv('File name.csv')
read_file.to_excel('File name.xlsx', index=None, header=True)

edited May 4, 2023 at 18:49

wjandrea

34k10 gold badges69 silver badges105 bronze badges

answered Nov 16, 2019 at 23:11

Bhanu Sinha

1,81615 silver badges10 bronze badges

4 Comments

Muneeb Ahmad Khurram Over a year ago

This probably the more OP way of doing it.

Ricky Levi Over a year ago

thanks ! how can i dump the content to the UI vs to file ?

john k Over a year ago

gave me an eerror. pandas.errors.ParserError: Error tokenizing data. C error: Expected 1 fields in line 33, saw 2

Niels Over a year ago

Maybe you have semi-colon or something else than comma as delimiter. In that case you can tell read_csv. Example: read_file = pd.read_csv('File name.csv', delimiter=';')

Paolo · Accepted Answer · 2019-12-27 06:15:28Z

27

First install openpyxl:

pip install openpyxl

Then:

from openpyxl import Workbook
import csv


wb = Workbook()
ws = wb.active
with open('test.csv', 'r') as f:
    for row in csv.reader(f):
        ws.append(row)
wb.save('name.xlsx')

edited Dec 27, 2019 at 6:15

Paolo

21.4k21 gold badges78 silver badges124 bronze badges

answered Mar 9, 2017 at 19:07

zhuhuren

3574 silver badges7 bronze badges

1 Comment

viltx Over a year ago

Easy And Support Unicode characters for it also!! It's best and simple for me.

patrickjlong1 · Accepted Answer · 2017-12-29 17:53:10Z

12

Adding an answer that exclusively uses the pandas library to read in a .csv file and save as a .xlsx file. This example makes use of pandas.read_csv (Link to docs) and pandas.dataframe.to_excel (Link to docs).

The fully reproducible example uses numpy to generate random numbers only, and this can be removed if you would like to use your own .csv file.

import pandas as pd
import numpy as np

# Creating a dataframe and saving as test.csv in current directory
df = pd.DataFrame(np.random.randn(100000, 3), columns=list('ABC'))
df.to_csv('test.csv', index = False)

# Reading in test.csv and saving as test.xlsx

df_new = pd.read_csv('test.csv')
writer = pd.ExcelWriter('test.xlsx')
df_new.to_excel(writer, index = False)
writer.save()

edited Dec 29, 2017 at 17:53

answered Dec 29, 2017 at 17:19

patrickjlong1

3,8231 gold badge21 silver badges34 bronze badges

2 Comments

Darren Smith Over a year ago

depends on openpyxl inside pandas

s3dev Over a year ago

Note: This depends on your CSV file being in flat-file format.

Larry W · Accepted Answer · 2020-04-08 20:16:44Z

7

Simple 1-to-1 CSV to XLSX file conversion without enumerating/looping through the rows:

import pyexcel

sheet = pyexcel.get_sheet(file_name="myFile.csv", delimiter=",")
sheet.save_as("myFile.xlsx")

Notes:

I have found that if the file_name is really long (>30 characters excluding path) then the resultant XLSX file will throw an error when Excel tries to load it. Excel will offer to fix the error which it does, but it is frustrating.
There is a great answer previously provided that combines all of the CSV files in a directory into one XLSX workbook, which fits a different use case than just trying to do a 1-to-1 CSV file to XLSX file conversion.

answered Apr 8, 2020 at 20:16

Larry W

1212 silver badges6 bronze badges

2 Comments

Muneeb Ahmad Khurram Over a year ago

simple way of doing it

Yuri Over a year ago

Just a note, this solution requires pyexcel's plugin called pyexcel-xlsx.

mcarton · Accepted Answer · 2018-08-25 20:28:09Z

4

How I do it with openpyxl lib:

import csv
from openpyxl import Workbook

def convert_csv_to_xlsx(self):
    wb = Workbook()
    sheet = wb.active

    CSV_SEPARATOR = "#"

    with open("my_file.csv") as f:
        reader = csv.reader(f)
        for r, row in enumerate(reader):
            for c, col in enumerate(row):
                for idx, val in enumerate(col.split(CSV_SEPARATOR)):
                    cell = sheet.cell(row=r+1, column=idx+1)
                    cell.value = val

    wb.save("my_file.xlsx")

edited Aug 25, 2018 at 20:28

mcarton

30.6k5 gold badges104 silver badges112 bronze badges

answered Aug 17, 2016 at 16:58

Rubycon

18.4k12 gold badges52 silver badges71 bronze badges

Comments

David Ding · Accepted Answer · 2017-05-05 02:23:25Z

1

There is a simple way

import os
import csv
import sys

from openpyxl import Workbook

reload(sys)
sys.setdefaultencoding('utf8')

if __name__ == '__main__':
    workbook = Workbook()
    worksheet = workbook.active
    with open('input.csv', 'r') as f:
        reader = csv.reader(f)
        for r, row in enumerate(reader):
            for c, col in enumerate(row):
                for idx, val in enumerate(col.split(',')):
                    cell = worksheet.cell(row=r+1, column=c+1)
                    cell.value = val
    workbook.save('output.xlsx')

answered May 5, 2017 at 2:23

David Ding

1,7591 gold badge16 silver badges13 bronze badges

Collectives™ on Stack Overflow

Python convert csv to xlsx

8 Answers 8

14 Comments

6 Comments

4 Comments

1 Comment

2 Comments

2 Comments

Comments

Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

8 Answers 8

14 Comments

6 Comments

4 Comments

1 Comment

2 Comments

2 Comments

Comments

Comments

Linked

Related