I am trying to build a malt index for all NCBI bacterial genomes (~145000 files of > 500 GB). Is this possible with Malt? I got a Java message out of memory, but I had allocated 900 GB of RAM when setting it up on our server. It started to create a massive index file before failing, so I do not think that my server would have the capacity even if it were possible. If not possible, can you recommend a set of bacterial genome references that would be good?
Along this same line, is it possible to build a malt index of NCBI nr or nt? Is this too much? Is Malt just designed for subsets of NCBI db?
Sorry for all the questions. I am keen to get started using Malt.