website/content/blog/iterativecsv.md at 8048d9f5ec3c4a719fa100f823af6e013e5ae3a0

brozek/website

Fork 0

mirror of https://github.com/Brandon-Rozek/website.git synced 2024-09-19 14:15:13 -04:00

Brandon Rozek 2a9c127365 Re-tagged blog posts

2022-01-02 14:24:29 -05:00

898 B

Raw Blame History

title

date

draft

Standard Library

import csv
with open('/path/to/data.csv', newline='') as csvfile:
   reader = csv.reader(csvfile, delimeter=',')
   for row in reader:
       for column in row:
           do_something()

Pandas

Pandas is slightly different in where you specify a chunksize which is the number of rows per chunk and you get a pandas dataframe with that many rows

import pandas as pd
chunksize = 100
for chunk in pd.read_csv('/path/to/data.csv', chunksize=chunksize):
    do_something(chunk)

898 B Raw Blame History

Standard Library

Pandas

898 B

Raw Blame History