Staggering token spending is forcing Indian firms to explore ways to rein in AI cost. This is where open-source models, small language models & inference layer startups step in, writes Swathi Moorthy.