SoftBank's Infrinia AI Cloud OS for GPU cloud services

CloudTech is part of the TechForge Publications seriesView AllAI NewsDeveloperIoT NewsMarketing TechTechHQTech Wire AsiaTelecomsView AllAI NewsDeveloperIoT NewsMarketing TechTechHQTech Wire AsiaTelecomsTechForge SearchNewsCategoriesCloud in ActionCloud MigrationCloud ROI & CostInternal Change ManagementMissteps & LessonsSME & Startup CloudEditorial DeskAnnouncements & AnalysisForecasts & TrendsMigrations: Behind the ScenesTechEx EventsFeaturesInterviewsPodcastsSponsored ContentVideosWebinarsFuture of CloudAI & CloudCloud EthicsEdge & Distributed CloudOpen CloudQuantum & CloudServerless ArchitectureSustainable CloudIndustry PerspectivesEducation & ResearchFinanceHealthcare & Life SciencesLegal & HRMedia, Gaming & CreativePublic SectorRetail & ConsumerMarket IntelligenceCloud StartupsEarnings & Market ShareEvent CoverageMergers & AcquisitionsVendor Roadmaps & LeadershipSecurity, Privacy & TrustCloud CybersecurityCyber Security & Cloud ExpoEncryption & Data PrivacyGovernance, Risk & ComplianceIdentity & AccessStrategy & Decision-MakingChoosing a Cloud StrategyFinOps & BudgetsLock-In & ExitMulti- & Hybrid CloudProcurement & ContractsSkills & HiringTechnology StackBig VendorsContainers & KubernetesDatabases & Data PlatformsInfrastructure as CodeObservability & MonitoringXaaS ModelsEventsResourcesOn-demand WebinarsExclusive VideosPodcastsAll ResourcesMoreAdvertiseAbout UsContact Us SearchNewsCategoriesCloud in ActionCloud MigrationCloud ROI & CostInternal Change ManagementMissteps & LessonsSME & Startup CloudEditorial DeskAnnouncements & AnalysisForecasts & TrendsMigrations: Behind the ScenesTechEx EventsFeaturesInterviewsPodcastsSponsored ContentVideosWebinarsFuture of CloudAI & CloudCloud EthicsEdge & Distributed CloudOpen CloudQuantum & CloudServerless ArchitectureSustainable CloudIndustry PerspectivesEducation & ResearchFinanceHealthcare & Life SciencesLegal & HRMedia, Gaming & CreativePublic SectorRetail & ConsumerMarket IntelligenceCloud StartupsEarnings & Market ShareEvent CoverageMergers & AcquisitionsVendor Roadmaps & LeadershipSecurity, Privacy & TrustCloud CybersecurityCyber Security & Cloud ExpoEncryption & Data PrivacyGovernance, Risk & ComplianceIdentity & AccessStrategy & Decision-MakingChoosing a Cloud StrategyFinOps & BudgetsLock-In & ExitMulti- & Hybrid CloudProcurement & ContractsSkills & HiringTechnology StackBig VendorsContainers & KubernetesDatabases & Data PlatformsInfrastructure as CodeObservability & MonitoringXaaS ModelsEventsResourcesOn-demand WebinarsExclusive VideosPodcastsAll ResourcesMoreAdvertiseAbout UsContact Us Subscribe Subscribe SearchNewsCategoriesCloud in ActionCloud MigrationCloud ROI & CostInternal Change ManagementMissteps & LessonsSME & Startup CloudEditorial DeskAnnouncements & AnalysisForecasts & TrendsMigrations: Behind the ScenesTechEx EventsFeaturesInterviewsPodcastsSponsored ContentVideosWebinarsFuture of CloudAI & CloudCloud EthicsEdge & Distributed CloudOpen CloudQuantum & CloudServerless ArchitectureSustainable CloudIndustry PerspectivesEducation & ResearchFinanceHealthcare & Life SciencesLegal & HRMedia, Gaming & CreativePublic SectorRetail & ConsumerMarket IntelligenceCloud StartupsEarnings & Market ShareEvent CoverageMergers & AcquisitionsVendor Roadmaps & LeadershipSecurity, Privacy & TrustCloud CybersecurityCyber Security & Cloud ExpoEncryption & Data PrivacyGovernance, Risk & ComplianceIdentity & AccessStrategy & Decision-MakingChoosing a Cloud StrategyFinOps & BudgetsLock-In & ExitMulti- & Hybrid CloudProcurement & ContractsSkills & HiringTechnology StackBig VendorsContainers & KubernetesDatabases & Data PlatformsInfrastructure as CodeObservability & MonitoringXaaS ModelsEventsResourcesOn-demand WebinarsExclusive VideosPodcastsAll ResourcesMoreAdvertiseAbout UsContact Us Hamburger Toggle Menu AI & Cloud, Announcements & Analysis, Serverless ArchitectureSoftBank’s Infrinia AI Cloud OS for GPU cloud servicesDavid Thomas29th January 2026 Share this story: Tags:infriniak8sKubernetessoftbankCategories::AI & CloudAnnouncements & AnalysisServerless ArchitectureJapanese multinational investment holding company, SoftBank, has launched Infrinia AI Cloud OS, a software stack custom-designed for AI data centres.Designed by the company’s Infrinia team, Infrinia AI Cloud OS lets data centre operators deliver Kubernetes-as-a-service (KaaS) in multi-tenant settings, and offer inference-as-a-service (Inf-aaS).Therefore, customers can access LLMs via simple APIs that can be added directly into an operator’s existing GPU cloud offerings.Infrinia Cloud OS meets growing global demandsThe software stack is expected to reduce total cost of ownership (TCO) and streamline day-to-day complexities, particularly when compared to options developed internally and custom-made stacks.

Ultimately, Infrinia Cloud OS promises to accelerate GPU cloud services deployments, simultaneously supporting each stage of the AI lifecycle, from training models to real-time use.Initially, SoftBank plans to incorporate Infrinia Cloud OS into its existing GPU cloud offerings before deploying the software stack globally to overseas data centres and cloud platforms in the future.Demand for GPU-powered AI has been increasing rapidly in many industries, from science and robotics to generative AI.As the complex needs of users also grows, it places demand on GPU cloud service providers.Some users require fully managed systems with “abstracted GPU bare-metal servers” while others need affordable AI inference without having to rely on GPU management directly.Others seek more advanced setups where AI model training is centralised and inference is implemented at the edge.Infrinia AI Cloud OS has been designed to meet these challenges, maximising GPU performance and easing management and deployment of GPU cloud services.Infrinia Cloud OS’ abilitiesWith its KaaS features, SoftBank’s latest software stack is able to automate every layer of the underlying infrastructure, from low-level server settings through to storage, networking, and Kubernetes itself.It can also reconfigure hardware connections and memory as and when required, letting GPU clusters to be produced, adjusted, or removed quickly to suit different AI workloads.

Automated node allocation, that is based on how close GPUs are connected and NVIDIA NVLink domains, helps reduce delays and improves GPU-to-GPU bandwidth for larger scale, distributed workloads.Infrinia’s Inf-aaS component has been designed so users can implement inference workloads easily, enabling faster and more scalable access AI model inference through managed services.By simplifying operational complexities and decreasing the TCO, Infrinia AI Cloud OS is positioned to accelerate the adoption of GPU-based AI infrastructure in different sectors worldwide. Want to learn more about Cloud Computing from industry leaders? Check out Cyber Security & Cloud Expo taking place in Amsterdam, California, and London.The comprehensive event is part of TechEx and co-located with other leading technology events.

Click here for more information.CloudTech News is powered by TechForge Media.Explore other upcoming enterprise technology events and webinars here.About the Author David ThomasTechnology Journalist David is an experienced content writer with over five years in the technology field, including a previous role as content team leader.He has a keen interest in artificial intelligence, robotics, and nanotechnology.

David researches and stays current with the latest tech developments through forums, podcasts, blogs, and more.Beyond his specialisations, he has explored niches including lifestyle, sports, entertainment, and his first love, music.Related Data centres gain their own insurance bracket as business risk increases29th January 2026 Nationwide is deepening its use of cloud services with AWS29th January 2026 How Mercedes F1 uses cloud for real-time decision-making27th January 2026 ByteDance steps up its push into enterprise cloud services21st January 2026 Data centres gain their own insurance bracket as business risk increases29th January 2026 Nationwide is deepening its use of cloud services with AWS29th January 2026 How Mercedes F1 uses cloud for real-time decision-making27th January 2026 ByteDance steps up its push into enterprise cloud services21st January 2026 Join our CommunitySubscribe now to get all our premium content and latest tech news delivered straight to your inbox Click here Popular Cloud ROI & Cost, Interviews, Sponsored Content, Sustainable CloudRipple effect: Xylem’s sustainable water solutions for Europe’s data centres 20359 view(s)Cloud Computing, XaaS ModelsConcern over cloud storage security remains says Spiceworks – but good news for OneDrive 12583 view(s)Big Vendors, Cloud Computing, Market IntelligenceOracle Cloud denies breach as hacker offers 6 million records for sale 5513 view(s)Big Vendors, Cloud Computing, Market Intelligence5 of the best: cloud technology training platforms 5384 view(s)Cloud ROI & Cost, Interviews, Sponsored Content, Sustainable CloudRipple effect: Xylem’s sustainable water solutions for Europe’s data centres 20359 view(s)Cloud Computing, XaaS ModelsConcern over cloud storage security remains says Spiceworks – but good news for OneDrive 12583 view(s)Big Vendors, Cloud Computing, Market IntelligenceOracle Cloud denies breach as hacker offers 6 million records for sale 5513 view(s)Big Vendors, Cloud Computing, Market Intelligence5 of the best: cloud technology training platforms 5384 view(s) See all Latest View All Latest Cloud in Action27th January 2026How Mercedes F1 uses cloud for real-time decision-making Cloud Computing21st January 2026ByteDance steps up its push into enterprise cloud services Sponsored Content21st January 2026Best cross-tenant migration tool: Securing enterprise cloud transitions Cloud in Action27th January 2026How Mercedes F1 uses cloud for real-time decision-making Cloud Computing21st January 2026ByteDance steps up its push into enterprise cloud services Sponsored Content21st January 2026Best cross-tenant migration tool: Securing enterprise cloud transitions SubscribeAll our premium content and latest tech news delivered straight to your inbox Subscribe ExploreAbout UsContact UsNewsletterPrivacy PolicyCookie PolicyAbout UsContact UsNewsletterPrivacy PolicyCookie PolicyReach Our AudienceAdvertisePost a Press ReleaseContact UsAdvertisePost a Press ReleaseContact UsCategoriesCloud in ActionEditorial DeskFeaturesFuture of CloudIndustry PerspectivesMarket IntelligenceSecurity, Privacy & TrustTechnology StackStrategy & Decision-MakingAll CategoriesCloud in ActionEditorial DeskFeaturesFuture of CloudIndustry PerspectivesMarket IntelligenceSecurity, Privacy & TrustTechnology StackStrategy & Decision-MakingAll CategoriesOther PublicationsExplore AllAI NewsDeveloperIoT NewsMarketing TechTechHQTech Wire AsiaTelecomsExplore AllAI NewsDeveloperIoT NewsMarketing TechTechHQTech Wire AsiaTelecomsCloudTech News is part of TechForge  SubscribeAll our premium content and latest tech news delivered straight to your inbox

Read More
Related Posts