Gene Length Normalization


#1

Hi

When parsing output from DIAMOND, is it possible to correct for the gene length bias. In essence, calculating RPKs for every gene?

Best,
Aditya


#2

Hi

I was exploring a few ways to do this, given the current set of export options in MEGAN6. However, it’s quite long-winding. The export as CSV option “SEED Name to Length” seems to export the sum of the read lengths. I would assume its quite straight forward to add an option for gene length normalization as the subject length for the best hit is available in the blast output?
Requesting if this could be implemented

Best,
Aditya


#3

Hi Aditya,

good idea… I have added this feature to MEGAN and will release the update later this week…
Best wishes
daniel


#4

Hi Daniel

That would be a life saver! Many thanks

Best,
Aditya