Ok let's first look at a screen shot. This is a screen shot of a text file we call it VCF file. How many rows it might have? Maybe 100,000 rows of things like this:

I am totally new and novice to MongoDB so I thought of a schema like this:

So for example notice REF in that text file is a Key/Value in my schema. But like I said it might have 200,000 rows... So:
- Are Arrays still a good thing I can use? storing 200,000 members in that array?
- How powerful I can query on it? so in the text file we have rows, for example that #CHROM20 in POS of 14370 has a REF of "G" and ALT of "A" ... so with my Schema can we find and return it? Let's say we say search for patients that have "G" in their REF field, so are MongoDB queries powerful enough to search and return such a result?
- Is it a bad schema? Do you have better recommendations/advice?
- Any sample query could you give for my qquesry in question will be so helpful to give me some ideas..
Let's say we say search for patients that have "G" in their REF field: Does ref:[TCG,TA] count or only ref:[A,T,ATC,G] ?