Chris Miller

Bioinformatics Grad student at Baylor College of Medicine. My online home is at http://www.chrisamiller.com/
RT @jcenters: People act like @SenTedCruz is an idiot. He graduated cum laude from Princeton and magna cum laude from Harvard. He’s just an asshole.
RT @randal_olson: Helpful guide for when you should use 3D pie charts in #dataviz. https://twitter.com/randal_...
Is it too soon to be experiencing seasonal affective disorder? #leavingworkinthedark #ugh
Jealous of the people who declare email bankruptcy and just delete it all. It's been a long time since #inboxzero
At the point in writing where I start saving the doc to several locations, lest one of them crash before I can submit. Must mean I'm close.
Comment: C: sciClone error for two sample - https://www.biostars.org/p...
SciClone can take care of removing those regions, if all your inputs are set up correctly. No need to filter yourself. - Chris Miller
RT @chriskingstl: Ballot amendments: public policy so bad you can't even pay a majority of state legislators to vote for it.
Comment: C: Are we seeing a bioinformatician shortage? - https://www.biostars.org/p...
Perhaps a more optimistic slant would be to classify ourselves as biologists, but ones who specialize in computational assays. By that metric, we get higher salaries than other academics, have less trouble finding jobs, etc. :-) - Chris Miller
Comment: C: sciClone error for two sample - https://www.biostars.org/p...
1) Regarding the mutation lists, If you call the following mutations: Tumor 1: 1 111 10 10 50.0 2 222 9 9 50.0 Tumor 2: 1 999 8 8 50.0 2 888 7 7 50.0 You then need to merge the calls and pull readcounts for every site in both samples, so your files will look like: Tumor 1: 1 111 10 10 50.0 1 999 6 6 50.0 2 222 9 9 50.0 2 888 5 5 50.0 Tumor 2: 1 111 4 4 50.0 1 999 8 8 50.0 2 222 3 3 50.0 2 888 7 7 50.0   2) Read the sciClone paper for a full explanation of why we only consider copy-number neutral sites for clustering: http://www.ploscompbiol.org/article... - Chris Miller
Answer: A: sciClone error for two sample - https://www.biostars.org/p...
Possible problems:  - Do you have readcounts for each site in both tumors? The first two columns of the lists of variants that you feed in should be identical. - Is your tumor polyploid, or mostly CN altered? If so, no points will be usable. - Is your CN data formatted incorrectly? By default, sciClone expects absolute copy number values (CN 2 = neutral, CN 3 = 1 copy amplified, etc). You can feed it log2 values by passing the appropriate flag. - Is your data low-coverage, such that no points are exceeding the minimum depth threshold? - Chris Miller
Comment: C: sciClone input, output and parameters - https://www.biostars.org/p...
Hi Lisa. I'd need more info to help you figure out why you're getting this erro. I'd suggest opening up a new Biostar question, and posting at least the header of your input files there so that I have some information to use for troubleshooting. Feel free to shoot me an email when that question is up ([email protected]) and I'll take a look. - Chris Miller
Job: Cancer Genomics/Genomic Medicine - The Genome Institute at WUSTL - https://www.biostars.org/p...
I have an opening for a position in my group at the Genome Institute at Washington University in St Louis that mixes research into cancer genomics and applying that knowledge to genomic medicine. Our group is a collection of mostly-PhD holding bioinformatics experts, who apply computational tools to help understand the origins and progression of cancer. We specialize on translating large-scale sequence data (genomic DNA, RNA-seq, and bisulfite-seq, among others) into insight about how to better detect cancer, predict its response to therapy, and treat patients. You will spend approximately 50% of your time analyzing and interpreting cases from our clinical sequencing group, in an environment where your work may directly influence patient care. The other half of your time will be spent on research projects, pairing with some fantastic oncologists to probe the genomics of tumorigenesis, relapse, and clonal evolution in response to therapy. Ideally, you'll have a strong biological... - Chris Miller
Comment: C: sciClone input, output and parameters - https://www.biostars.org/p...
Yep - just merge them into one file and use that as your "regions to exclude" file. - Chris Miller
Answer: A: TCGA/Broad Institute CNV Files Segment Mean - https://www.biostars.org/p...
Those are the log2 ratio of the tumor intensity to the normal intensity. To convert to an absolute cn, use:  (2^seg_mean)*2 - Chris Miller
Answer: A: sciClone input, output and parameters - https://www.biostars.org/p...
If you look at the help (with ?sciClone), you'll find many of the answers you're looking for. 1. My copy number data used for input is log-ratio data. It seems though as sciClone interpreted it as absolute copy number data, since only cn=2 were used (and according to the paper, sciClone only looks at VAFs in regions of normal cn). Is it possible to make the program use the log-ratio data? When calling the sciClone() function, set cnCallsAreLog2=TRUE 2. The program seems to have filtered away mutations with depth<100. Is it possible to decrease this limit? SciClone works best with deep readcounts, as they decrease the uncertainty associated with each VAF. That said, if you want to decrease it, when calling the sciClone() function, change the minimumDepth parameter. 3. Finally, the output contains one cluster. However, visually there are 3 subclusters in that cluster. I understand they are too close to each other to be considered separate clusters. Is it possible to make the program... - Chris Miller
I can't take any software tool seriously that spits out windows-style carriage returns in it's output
RT @neilfws: Bioinformaticians have been writing for years about how data preparation is >= 80% of the job; good to see "big data science" catching up :)
RT @nextSTL: In Ferguson, Black Town, White Power http://www.nytimes.com/2014... an op-ed by @JeffSmithMO
RT @nytimesbits: For ‘Big Data’ Scientists, Hurdle to Insights Is ‘Janitor Work’ http://www.nytimes.com/2014...
I need some forms/permits for my new house, but can't get them because of the DDOS attack against STL county website. Screw you, Anonymous.
RT @ivanoransky: Authors retract highly cited XMRV-prostate cancer link paper from PNAS http://retractionwatch.com/2014...
package available here: https://t.co/mbflgUhRod and code and data for reproducing the figures/analyses is here: https://t.co/hIQL0fqgCg
Our paper on inferring the clonal architecture of heterogeneous tumors is live today: http://www.ploscompbiol.org/article...
Anyone hear back about poster/talk assignments for Genome Informatics? They were supposed to be out Tuesday and no word... #gi2014
RT @pkedrosky: Latest population-adjusted “peak driving” graph from Doug Short. We’re back to Dec 1994. https://twitter.com/pkedros...
SNL has ruined 'Don't Fear the Reaper' for me. All I hear is cowbell.
Sounds about right: On Tuesday, vote No, No, No, No and No. http://www.stltoday.com/news...
Four days away from work, 220 emails in the inbox. Let's do this.
"I can see clearly now the rain is gone. I can see all popsicles in my way". -- My two year old sings Motown, volume one
"Every labor economist who has studied the pipeline for the biomedical workforce has proclaimed it to be broken" http://www.pnas.org/content...
RT @GraveleyLab: Please don't put the impact factor of the journal your papers are in on your CV!
RT @kerencarss: This is an important article. WGS has better coverage uniformity than WES, and less bias in variant calling. http://www.biomedcentral.com/1471-21...