Skip to main content
Post Made Community Wiki by Omar Kooheji
added 346 characters in body
Source Link
vartec
  • 20.9k
  • 1
  • 54
  • 99
  1. run it through spell check, it will find very few proper English words;
  2. dot followed by word w/o space, also not proper English punctuation;
  3. something(); just cannot be plain English;

First, run it through spell check, it will find very few proper English words, however there should be lot of words that spellchecker will suggest to split.

Then there are punctuation/special characters not typical for plain English, typical for code:

  • something(); just cannot be plain English;
  • $something where something is not all numeric;
  • -> between words w/o spaces;
  • . between words w/o space;

Of course to have it working well, you might want to have Bayesian classifier built on top of these characteristics.

  1. run it through spell check, it will find very few proper English words;
  2. dot followed by word w/o space, also not proper English punctuation;
  3. something(); just cannot be plain English;

First, run it through spell check, it will find very few proper English words, however there should be lot of words that spellchecker will suggest to split.

Then there are punctuation/special characters not typical for plain English, typical for code:

  • something(); just cannot be plain English;
  • $something where something is not all numeric;
  • -> between words w/o spaces;
  • . between words w/o space;

Of course to have it working well, you might want to have Bayesian classifier built on top of these characteristics.

Source Link
vartec
  • 20.9k
  • 1
  • 54
  • 99

  1. run it through spell check, it will find very few proper English words;
  2. dot followed by word w/o space, also not proper English punctuation;
  3. something(); just cannot be plain English;