Write result of Python Script to txt file

Question

I have the following script in Python that is meant to find words with two or more vowels in them and output the result to a txt file. The script currently runs, but the output file is empty. I have tried several different methods to no avail, any idea why the output file is blank? I am using the (re) import to treat the input as a regular expression.

#!C:\Python33\python.exe

import re

file = open("Text of Steve Jobs' Commencement address (2005).htm");
output = open('twoVoweledWordList.txt', 'w');

for word in file.read():
   if len(re.findall('[aeiouy]', word)) >= 2:
      match == True;
      while True :
        output.write(word, '\n');

        file.close()
        output.close()

file.read() reads one character at a time and you are looking it up for two vowels. — Himanshu
– Himanshu, Commented Oct 29, 2013 at 1:22
That makes sense! What would be a better way to read each word in at a time? — troy_frommer
– troy_frommer, Commented Oct 29, 2013 at 1:23
match == True is a comparison, not an assignment. Also, in Python you don't need a semicolon on the end of any line. — steveha
– steveha, Commented Oct 29, 2013 at 1:23
There is no need to loop or set a match flag or anything else like that. if len(re.findall('[aeiouy]', word)) >= 2 is already exactly the condition under which we want to write the word to the output file, and we want to write that given word exactly once. — Karl Knechtel
– Karl Knechtel, Commented Oct 29, 2013 at 1:52

steveha · Accepted Answer · 2013-10-29 01:35:38Z

5

You asked for a better way to read a word at a time. Here you go:

with open(input_file_name, "rt") as f:
    for line in f:
        for word in line.split():
            # do something with each word here

Comments:

In general I try to avoid using built-in Python features as variable names. Since file is a built-in in Python 2.x, syntax-coloring text editors will flag it in a different color... might as well just use f for the variable name.
It's best to use the with statement. It is very clear, and in all versions of Python it makes sure your file is properly closed when you are done. (Here it won't matter, but it's really a best practice.)
open() returns an object that you can use in a for loop. You will get one line of input from the file at a time.
line.split() splits the line into words, using any "white space" (spaces, tabs, etc.)

I don't know if you have seen generator functions yet, but you can wrap up the above doubly-nested for loops into a generator function like this:

def words(f):
    for line in f:
        for word in line.split():
            yield word

with open(input_file_name, "rt") as f:
    for word in words(f):
        # do something with word

I like hiding the machinery like this. And if you ever needed to make the word-splitting more complicated, the complex part is nicely separated from the part that actually handles the words.

edited Oct 29, 2013 at 1:35

answered Oct 29, 2013 at 1:24

steveha

77.1k21 gold badges94 silver badges119 bronze badges

Sign up to request clarification or add additional context in comments.

3 Comments

troy_frommer Over a year ago

I implemented the function as you listed above and my output is the entire html file. I am using if len(re.findall('aeiou]', word)) >= 2 output.write(word + '\n'

Karl Knechtel Over a year ago

Double-check your regex :)

troy_frommer Over a year ago

Found out the issue, it wasn't overwriting the old file :) Less of a code issue more of a dumb human one, thanks for the help!

thefourtheye · Accepted Answer · 2013-10-29 01:31:51Z

1

When you use with statement you dont have to worry about closing the file explicitly. And y is not a vowel, I believe. So, I removed it from my answer.

import re

with open("Input.txt") as inputFile, open("Output.txt", "w") as output:
    for line in inputFile:
        for word in line.split():
            if len(re.findall('[aeiou]', word)) >= 2:
                output.write(word + '\n')

answered Oct 29, 2013 at 1:31

thefourtheye

241k53 gold badges466 silver badges505 bronze badges

Comments

Himanshu · Accepted Answer · 2013-10-29 01:45:29Z

0

While steveha says it nicely, just in case you like for loops better :-

import re

file = open("Text of Steve Jobs' Commencement address (2005).htm")
output = open('twoVoweledWordList.txt', 'w')

for line in file:
    for word in line.split():
       if len(re.findall('[aeiouy]', word)) >= 2:
          output.write(word + '\n')

edited Oct 29, 2013 at 1:45

answered Oct 29, 2013 at 1:31

Himanshu

2,4743 gold badges26 silver badges42 bronze badges

2 Comments

steveha Over a year ago

I recommend re-writing the first for loop as simply: for line in file: The file.readlines() method function will read the entire file into memory, but we only need one line at a time. Simply using the opened file object as an iterator will read one line at a time. This won't matter much for small files, but what if the file was 10 GB of data? Then it would matter a lot.

Himanshu Over a year ago

Thanks. I think it is best done the way in your answer. I just put it as an alternative and because I had written it all up :p

Collectives™ on Stack Overflow

Write result of Python Script to txt file

3 Answers 3

3 Comments

Comments

2 Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

3 Comments

Comments

2 Comments

Your Answer

Sign up or log in

Post as a guest

Related