Datagrom AI News Logo
Feb 6, 2025
Datagrom AI News Cloud

GPT-4o generated super-summaries of last 90 days of Snowflake and Databricks news
Fine-tuned GPT-3.5 model generated super-summary of last 90 days

Top 20 stories
From the last 90 days
0 stories filtered / 20 added
Updated daily
From February 6, 2025
0 stories filtered / 0 added

SoftBank in talks to invest as much as $25B in OpenAI, report says

SoftBank in talks to invest as much as $25B in OpenAI, report says

January 30, 2025: SoftBank Eyes Major Investment in OpenAI - SoftBank is in talks to invest up to $25 billion in OpenAI, which could make it the largest investor. This move aligns with the $100 billion Stargate data center project, a joint venture where both OpenAI and SoftBank will contribute significantly.

Amid competitive concerns, OpenAI accuses DeepSeek of illegally using its models after DeepSeek released a cost-efficient model. The SoftBank investment could lessen OpenAI's dependence on Microsoft, as OpenAI explores shifting to a for-profit model to boost fundraising efforts.

Ex-Google, Apple engineers launch unconditionally open source Oumi AI platform that could help to build the next DeepSeek

Ex-Google, Apple engineers launch unconditionally open source Oumi AI platform that could help to build the next DeepSeek

January 29, 2025: Oumi Launches Fully Open AI Platform - Ex-Google and Apple engineers have launched Oumi, an open-source AI platform that offers transparency by providing model code, weights, and training data. Supported by top universities, Oumi eliminates project silos and streamlines AI model building with integrated tools, supporting models of various sizes and advanced training techniques.

Oumi provides a scalable environment from local to cloud-based infrastructures and uses distributed computing to lower costs, enhancing collaborative AI research and enterprise deployment. Unlike large investments by companies like OpenAI, Oumi aims to make AI more accessible and cost-effective.

Microsoft probing whether DeepSeek improperly used OpenAI APIs

Microsoft probing whether DeepSeek improperly used OpenAI APIs

January 29, 2025: Microsoft Investigates DeepSeeks OpenAI API Misuse - Microsoft is investigating Chinese company DeepSeek for allegedly using OpenAI's API to train its own AI models, potentially violating OpenAI's terms of service. These terms prohibit using API output to develop competing models and disallow automated data extraction. DeepSeek may have used distillation techniques to extract OpenAI’s model knowledge and bypassed rate limits to gather data at scale.

As OpenAI's major shareholder, Microsoft has notified OpenAI of these activities, which could result in significant legal consequences.

Dario Amodei challenges DeepSeek’s $6 million AI narrative: What Anthropic thinks about China’s latest AI move

Dario Amodei challenges DeepSeek’s $6 million AI narrative: What Anthropic thinks about China’s latest AI move

January 29, 2025: Amodei Debunks DeepSeeks AI Cost Narrative - Dario Amodei of Anthropic challenges the narrative around DeepSeeks' $6 million AI model, presenting a more nuanced picture. He reveals that DeepSeeks' overall investment, including $1 billion in computing hardware, rivals that of U.S. AI companies. Amodei emphasizes the true innovation was in DeepSeek-V3, rather than R1.

He predicts that the current parity in AI development costs is temporary, with future competition favoring companies with significant resources. Amodei's analysis exposes the complexity behind AI cost and investment, countering the simplified narrative that shocked the markets.

OpenAI finds DeepSeek used its data to train R1 reasoning model

OpenAI finds DeepSeek used its data to train R1 reasoning model

January 29, 2025: OpenAI Accuses DeepSeek of Data Breach - OpenAI suspects that its data was used to train DeepSeek's R1 reasoning model, violating OpenAI's terms of service. Microsoft identified the potential misuse, leading OpenAI to block the involved users' API access. DeepSeek reportedly used distillation to transfer knowledge, potentially reducing training costs. The R1 model primarily employs a mixture of experts approach and was mainly trained with reinforcement learning to improve reasoning skills.

Concerns about R1's efficiency led to a 17% drop in Nvidia's stock shares. In response to these developments, OpenAI emphasizes its collaboration with the U.S. government to protect its AI technologies from competitors.

Microsoft brings a DeepSeek model to its cloud

Microsoft brings a DeepSeek model to its cloud

January 29, 2025: Microsoft Integrates Controversial DeepSeek Model into Azure - Microsoft now provides DeepSeeks R1 model on its Azure AI Foundry, despite IP violation concerns with OpenAI. Although R1 has faced criticism for inaccuracy and censorship, Microsoft promises thorough security and safety checks. Customers will soon access lighter R1 versions on Copilot+ PCs.

This decision is notable as Microsoft investigates DeepSeeks potential misuse of OpenAIs API. Despite these uncertainties, Microsofts interest in R1 highlights its significant appeal in the AI field.

David Sacks claims there’s ‘substantial evidence’ that DeepSeek used OpenAI’s models to train its own

David Sacks claims there’s ‘substantial evidence’ that DeepSeek used OpenAI’s models to train its own

January 28, 2025: DeepSeek Accused of Exploiting OpenAI Models - David Sacks, the U.S. AI and crypto czar, alleges substantial evidence that China's DeepSeek used outputs from OpenAI models to train its AI, likening it to theft. Although Sacks hasn't disclosed the evidence, he notes OpenAI's dissatisfaction.

DeepSeek's popular AI models and apps are under U.S. government scrutiny. The National Security Council is examining their implications, and the U.S. Navy has banned their use due to security and ethical concerns.

OpenAI launches ChatGPT plan for U.S. government agencies

OpenAI launches ChatGPT plan for U.S. government agencies

January 28, 2025: OpenAI Launches ChatGPT for U.S. Government Use - OpenAI has launched ChatGPT Gov, a specialized AI chatbot for U.S. government agencies. This platform includes features from ChatGPT Enterprise and allows the deployment of models on Microsoft Azure clouds, ensuring enhanced security, privacy, and compliance.

It aims to simplify the management of non-public sensitive data. Since 2024, over 90,000 users from 3,500 government agencies have used ChatGPT for daily operations, exchanging more than 18 million messages.

Hugging Face researchers are trying to build a more open version of DeepSeek’s AI ‘reasoning’ model

Hugging Face researchers are trying to build a more open version of DeepSeek’s AI ‘reasoning’ model

January 28, 2025: Hugging Face Develops Open AI Reasoning Model - Hugging Face researchers aim to develop an open-source alternative to DeepSeeks AI reasoning model, focusing on increasing transparency and accessibility within the AI community. Their goal is to make powerful AI models more widely available, fostering innovation and collaboration in advanced AI reasoning capabilities.

This initiative reflects Hugging Face's commitment to openness and sharing knowledge in AI research. By creating accessible models, they hope to drive forward the capabilities of AI and encourage wider participation in AI development and research.

Viral AI company DeepSeek releases new image model family

Viral AI company DeepSeek releases new image model family

January 27, 2025: DeepSeeks Janus-Pro Models Outperform DALL-E 3 - Viral AI company DeepSeek has unveiled Janus-Pro, a new family of multimodal AI models available on Hugging Face. These models, ranging from 1 to 7 billion parameters, reportedly surpass OpenAI's DALL-E 3 and other models like Stable Diffusion XL. Despite a resolution limit of 384 x 384 for analysis, Janus-Pro's performance is notable for its compact size and versatility, marking it as a frontrunner for next-gen AI models.

Funded by High-Flyer Capital, DeepSeek is challenging the AI landscape, raising questions about the future of AI leadership and chip demand. The introduction of Janus-Pro brings attention to the evolving dynamics and potential shifts in the market, positioning DeepSeek as a significant player in advancing AI technology.

Meta AI can now use your Facebook and Instagram data to personalize its responses

Meta AI can now use your Facebook and Instagram data to personalize its responses

January 27, 2025: Meta AI Personalizes with Your Social Data - Meta AI personalizes responses using data from Facebook and Instagram, remembering user preferences and interactions. Available on Facebook, Messenger, and Instagram in the U.S. and Canada, this feature offers tailored recommendations like dietary advice and creative content suggestions. Meta CEO Mark Zuckerberg highlighted its potential for personalized storytelling.

The update raises privacy concerns, as there is no opt-out option, prompting skepticism about Meta's data practices. Despite these concerns, Meta emphasizes the customization benefits of the feature.

OpenAI-backed 1X acquires Kind Humanoid

OpenAI-backed 1X acquires Kind Humanoid

January 27, 2025: 1X Acquires Kind Humanoid to Boost Robotics Vision - 1X has acquired Kind Humanoid, a promising Norwegian startup known for partnering with designer Yves Bhar to create commercial robots. This acquisition aligns with 1X's mission of creating abundant labor through intelligent humanoids, aided by OpenAI's backing and past funding successes. The move marks a significant development in the humanoid robotics field.

The partnership will support 1X's expansion in the Bay Area and highlights the merge as a step towards integrating large language models with humanoid robotics. Financial details of the acquisition remain undisclosed.

Snowflake claims breakthrough can cut AI inferencing times by more than 50%

Snowflake claims breakthrough can cut AI inferencing times by more than 50%

January 16, 2025: Snowflake Unveils SwiftKV for Faster AI Inference - Snowflake Inc. has unveiled SwiftKV, an optimization technique that significantly reduces AI inferencing times and costs for large language models. By recycling hidden states, SwiftKV improves inference throughput by 50% and reduces costs for Llama models by up to 75%.

This technique enhances real-time applications by lowering memory usage and computational overhead, benefiting tasks like chatbots and translation. SwiftKV maintains high accuracy with minimal quality loss, offering substantial performance boosts for unstructured text processing and latency-sensitive tasks within Snowflake's Data Cloud platform.