One problem I find with the FQG = "the" hypothesis is the second line, which ends up being:
And I don't see how that can make sense.
However, one of the common group of four symbols is FQOA. FQOD, and FQOQ also occur. Assuming FQG=the, I'd go with that/than (i.e. O = "a", A = "t", D = "n"), but I can't find an explanation for "thoh": there's no word with this string of letters, and all of "...t hoh...", "...th oh...", and "...tho h..." make little sense. Same thing if you suppose this/thin or thus/thug.
Based on the outstanding frequency of F, I've also tried to assume it was "e", and since QQ also occurs, I've tried assuming Q was "s" based on the coupled frequencies of "ss" and "es". It's not very promising so far.
There's certainly something to be done with the fact that only F, Q, A, O, T, and Z occur in doubles (F and Q 4 times each; the others only once) and the frequencies of double letters in English.
As a side note: In this text, there is indeed no triple of the same symbol, but several in doubles. This strenghtens the hypothesis that it is a simple cypher for English, as a more complex cypher would produce a few triples, and a monoalphabetic cypher of Arabic would not contain symbols in double.
> Date: Tue, 15 Jun 2010 16:24:29 -0700
> From: [log in to unmask]
> Subject: Re: Why do so many crackpots go for "crypto"?
> To: [log in to unmask]
> On Tue, Jun 15, 2010 at 3:07 PM, Patrick Dunn <[log in to unmask]> wrote:
> > If the plaintext is in Arabic, I notice that FQ is commonly occurs at
> > the beginnings of lines. Perhaps this is the definite article?
> > That's the limit of my Arabic, though.
> One characteristic of ciphers that are difficult to break is that they
> give the appearance of true randomness, having disguised the
> statistical regularities of language. This text does not show uniform,
> or even nearly uniform distribution of letters and letter pairs. Those
> distributions are far too language-like, meaning the statistical
> properties of the language are still present in the cipher text.
> Here are letter pair frequencies (exclusive of pairs that cross line boundaries)
> 55 FQ
> 20 JF
> 13 QG
> 12 VF, IF
> 11 QO, FI, EJ
> 10 AJ
> 9 QA, 9 EF, 9 AI
> 7 SV, 7 QF, 7 OA
> 6 TF, 6 GI, 6 AN
> 5 TJ, QY, NE, JV, JG, GF, FV, FG, FE, DA, AF, AE
> 4 YA, WQ, QS, QQ, QJ, OF, JB, IA, GA, FF, FA, BJ, AW, AV, AT
> 3 WI, WE, VZ, VW, VG, VE, UA, QU, QE, QB, PA, OP, NA, MT, GJ, FZ, FO,
> EV, EA, BA, AO
> 2 ZQ, XA, WA, VT, VQ, VA, UJ, TA, SE, QP, QN, QM, QK, QI, QC, PQ, PG,
> PF, OW, OV, OD, NJ, MQ, MO, LE, KS, IW, IV, IT, IJ, HZ, HQ, GW, GS,
> GP, GO, FT, FM, FB, ET, EP, EN, DE, CG, AS, AG
> 1 ZZ, ZW, ZV, ZP, ZF, ZE, ZA, YN, YF, XN, XJ, XE, WX, WT, WS, WJ, WF,
> VR, VP, VL, VJ, VI, VH, VD, TW, TV, TT, TS, TO, TD, SO, SH, SG, SF,
> SA, RV, RG, QV, QK, QD, PO, PJ, OX, OT, OS, OQ, OO, OH, OG, OE, OC,
> NX, NQ, NO, NM, NG, NF, MG, MF, LF, KV, KQ, KE, JX, JW, JO, JN, JM,
> JK, JD, JC, JA, IX, IU, IR, IN, IH, IG, GY, GV, GT, GQ, GM, GL, GD,
> GC, FW, FN, FL, FK, FD, EK, EC, DO, DN, DG, DF, CV, CI, CF, CE, BI,
> BF, AU, AQ, AM, AK, AD, AA
> If the plaintext were in English then 'FQ' would almost certainly be
> "th". Three-letter combinations starting with FQ are:
> 12 FQG C, FQG D, FQG J, FQG L, FQG M, FQG O, FQG P, FQG V, FQG W
> 8 FQO A, FQO D, FQO Q
> 5 FQY A, FQY N
> 4 FQS A, FQS V
> 4 FQQ J, FQQ U
> 4 FQF G, FQF I, FQF L, FQF Q
> 3 FQE F, FQE P
> 3 FQB A
> 2 FQP A, FQP Q
> 2 FQA J, FQA V
> 1 FQV L
> 1 FQN M
> 1 FQK E
> 1 FQD A
> 1 FQC I
> 1 FQC G
> 1 FQA
> with "FQG" being the most frequent, and with the most variety of
> letters following, so, again assuming English plaintext, FQG = "the"
> would be a good starting point.
> Of course, the author may have been diabolically clever and written
> English from right to left, in which case <sarcasm>we will never be
> able to figure out his fiendish cipher.</sarcasm>
MSN Rencontres: Faites une rencontre spéciale cette année.