Using this as my corpus http://americanbookreview.org/100bestlines.asp 5 minutes of python hackery gives me this advice. You should start your novel with "It" (11%) "I" (11%) or "The" (8%) or maybe "In" (6%) or "Once" (4%). For an outside bet, use "He" or "You" (3% each). If you want to take a punt then go with "When" "A" "They" or "If" 2% each. That covers more than 50% of the data and the rest is experimental error. Nice to get some stats on the problem.
Comments 2
Reply
You should start your novel with "It" (11%) "I" (11%) or "The" (8%) or maybe "In" (6%) or "Once" (4%). For an outside bet, use "He" or "You" (3% each). If you want to take a punt then go with "When" "A" "They" or "If" 2% each. That covers more than 50% of the data and the rest is experimental error. Nice to get some stats on the problem.
"So" had no occurrences.
Reply
Leave a comment