Alex pfp

Alex

@alexdyor

246 Following
41 Followers


Alex pfp
Alex
@alexdyor
MiniMax is holding a week of releases — two powerful models have already been released Chinese startup MiniMax is following DeepSeek and is holding its own week of releases. What has already been shown: On Monday — the M1 reasoning model in open source with a million context tokens (a record!). It cost only $500k, and according to the results it is approaching Gemini 2.5 Pro. Report, GitHub (https://github.com/MiniMax-AI/MiniMax-M1) and scales. (https://huggingface.co/collections/MiniMaxAI/minimax-m1-68502ad9634ec0eeac8cf094) Yesterday — text/image2video model Hailuo 2 with advanced physics and movements. You can try poking for free. (https://hailuoai.video/create) We are waiting for them to release today
0 reply
0 recast
0 reaction

Alex pfp
Alex
@alexdyor
#Monmorning
0 reply
0 recast
0 reaction

Alex pfp
Alex
@alexdyor
#daіlymonad https://madness.finance/exp?ref=WLWBYJ earned 200 points on the leaderboard
0 reply
0 recast
0 reaction

Alex pfp
Alex
@alexdyor
Microsoft introduced a new video generation tool — Bing Video Creator! Now users of the Bing mobile application on Android and iPhone can create short video clips for free, using the text-video model from OpenAI called Sora. The launch of Bing Video Creator underscores Microsoft's desire to democratize AI technologies, allowing anyone to easily convert text into engaging videos. Users can generate up to three videos simultaneously and choose between standard and fast creation speed. This event adds a new dimension to the development of AI applications and reflects the growing interest in media formats.
0 reply
0 recast
1 reaction

Alex pfp
Alex
@alexdyor
OpenAI has launched o3-pro for users with Pro subscription in ChatGPT and via API. It is slightly better than o3 in coding, analytics, science and writing. But not a coup to give $200. Therefore, we are waiting for them to roll out for Plus subscribers. Tests show that the model follows instructions better, generates more structured answers. And it makes up less - in the test it gave the correct answer to the same question four times in a row. Like o3, o3-pro supports all the tools that are available in ChatGPT. In addition, OpenAI has significantly reduced the price of the o3 model via API. And Sam Altman added that the open-scale model will not be released in June as planned. It has been postponed to later this summer, because they say something unexpected and very powerful awaits us in June. Is it GPT-5?
0 reply
0 recast
0 reaction

Alex pfp
Alex
@alexdyor
ElevenLabs presented Eleven v3 (alpha) — the most expressive text voiceover model The most expressive voiceover model for today's text. It supports 70+ languages, multi-voice mode, and now — audio tags that set intonation, emotions, and even pauses in speech. New architecture better understands text and context, creating natural, "live" audio. What Eleven v3 can do: • Generate realistic dialog with multiple voices • Read emotional transitions • React to the context and change the tone during the speech The model is managed through tags: - Emotions: [sad], [angry], [happily] - Delivery: [whispers], [shouts] - Reactions: [laughs], [sighs], [clears throat] The public API is promised to be rolled out very soon. This is a preview version - it may require fine-tuning of the prompts. But the result is really impressive
0 reply
0 recast
1 reaction

Alex pfp
Alex
@alexdyor
Apple presented at WWDC 2025 a new approach to operating system interfaces using Liquid Glass. This design language changes the appearance of application interfaces on devices. The update includes a new game application, improved multitasking on iPadOS and the integration of ChatGPT into the Apple Image Playground project, which provides a richer user experience. This significant development underscores Apple's commitment to advancements in the field of AI and opens up new opportunities for developers working with intelligence on devices.
0 reply
0 recast
0 reaction

Alex pfp
Alex
@alexdyor
OpenAI has updated Codex — and now it’s even closer to a real AI developer. Here’s what’s been added: 1️⃣ Codex is now available for ChatGPT Plus. So far with loyal limits, but during high load, restrictions may appear to keep the model stable. 2️⃣ The most anticipated update: Codex can now go online while executing tasks — to set dependencies, run tests, pull resources, or update packages. Finally. 3️⃣ Internet access is disabled by default — you can enable it when creating or editing an environment. Full control over HTTP domains and methods — everything is clear. 4️⃣ This feature has been added to Plus, Pro, and Team users. Enterprise support is coming soon. 5️⃣ And a bunch of other useful little things: – now, if Codex is working on a task, it updates an existing Pull Request instead of creating a new one, etc.
0 reply
0 recast
6 reactions

Alex pfp
Alex
@alexdyor
Amazon is taking a step forward in robotics and artificial intelligence by developing software for humanoid robots that will help deliver packages. The company is close to completing an “internal humanoid fleet” to test robots in real-world conditions, working in tandem with Rivian electric vehicles. This innovative approach aims to create versatile robots that can understand and respond to natural language commands, which will significantly improve Amazon's logistics operations.
0 reply
0 recast
4 reactions

Alex pfp
Alex
@alexdyor
Anthropic has announced that upgraded versions of its Sonnet and Opus models will be released in the coming weeks. The new versions of Claude will feature hybrid thinking capabilities and expanded tool handling. The test model, codenamed "Neptune" (possibly a reference to version 3.8 as the eighth planet in the solar system), is currently undergoing security testing. In parallel, Anthropic is launching a new bug bounty program to test Claude's security principles. While Anthropic releases updates less frequently than its competitors (the latest 3.7 Sonnet model was released in February), the company is clearly focused on quality, not speed.
0 reply
0 recast
1 reaction

Alex pfp
Alex
@alexdyor
🚀 Manus has been updated: now it creates professional presentations The AI ​​agent Manus has received new capabilities for generating presentations. Now it creates slides based on one direction and several samples. Manus analyzes the Internet and books, filling presentations with detailed information, images and videos. The user can edit the result at his own discretion. You can poke around on the official website. (https://manus.im/)
0 reply
0 recast
0 reaction

Alex pfp
Alex
@alexdyor
The project promises an impressive throughput of 10,000 transactions per second (TPS), combined with a fast 500ms block time and 1 second finalization, while aiming for near-zero or sub-cent gas fees.1 This positions Monad as a significantly faster and more cost-effective alternative to existing EVM-compatible blockchains. In simple terms: the developers promise a very fast and very cheap chain The real facts: so far it lives up to its claims))
0 reply
0 recast
0 reaction

Alex pfp
Alex
@alexdyor
Comprehensive Analysis and Strategic Recommendations for the Monad Testnet. Part 1 I. Executive Summary Monad Vision and Key Innovations Monad is positioned as a high-performance, EVM-compatible, Layer 1 blockchain, purpose-built to address the scalability trilemma by delivering high throughput without compromising decentralization or security.1 This fundamental goal underlies all of its architectural solutions. Its innovative technology stack includes the MonadBFT consensus engine, groundbreaking parallel and deferred transaction execution, and a purpose-built MonadDB database for optimized state storage.1 These components work in synergy to achieve ambitious performance goals.
0 reply
0 recast
0 reaction

Alex pfp
Alex
@alexdyor
Gemini and Monad (there will be many parts)
0 reply
0 recast
0 reaction

Alex pfp
Alex
@alexdyor
I use it, I paid for it for a very long time, they were practically the only ones who checked and corrected grammar in the text and correspondence - now I hope that their level will become even stronger
0 reply
0 recast
0 reaction

Alex pfp
Alex
@alexdyor
Google Unveils Gemini 2.5 Pro Preview The model is available via the Gemini API, as well as on the Vertex AI and AI Studio platforms, with an update coming soon to the Gemini app for devices. Gemini 2.5 Pro Preview (I/O Edition) has significantly improved code writing and editing capabilities. The model topped the WebDev Arena Leaderboard rating, which evaluates the ability of AI to create functional sites, and showed high results in the field of video analysis, scoring 84.8% in the VideoMME test, one of the popular AI tests.
0 reply
0 recast
0 reaction

Alex pfp
Alex
@alexdyor
🔧 MCP SuperAssistant - AI agent right in the browser I discovered an interesting tool that turns a regular chat with AI (ChatGPT, Gemini, Perplexity) into an agent that doesn't just answer, but performs tasks. Ask to create a document, get data or send an email, and it really does it! No additional API keys or complicated settings. What it can do: • Create Google Docs • Work with Trello, Asana • Send emails and messages in Slack • Extract data from various sources For those who want to automate their routine - I advise you to test it! You can try it for free here (https://mcpsuperassistant.ai/)
0 reply
0 recast
0 reaction

Alex pfp
Alex
@alexdyor
now Copilot is testing a new feature - Agent Actions (https://www.testingcatalog.com/microsoft-copilot-starts-testing-agent-actions-and-adds-native-image-generation/), (https://www.testingcatalog.com/microsoft-copilot-starts-testing-agent-actions-and-adds-native-image-generation/) a kind of AI that performs actions in applications instead of you. Very similar to the Operator concept. A kind of smart agent-bot. It seems that Microsoft is also seriously getting into the AI ​​agent game. And, in fact, by itself, without even involving OpenAI. The only thing is that, unlike Google, Microsoft does not have its own GPT models, so there remains a dependency here.
0 reply
0 recast
0 reaction

Alex pfp
Alex
@alexdyor
Grok now creates any PDF documents with one prompt The neural network from X has received a new feature — now it generates documents that really look good and fill in. Generates: - Resumes, reports, notes, presentations and any PDF - Formatting, fonts, formulas and graphs — everything as it should - Looks neat, as if a designer did it - And all this for free! Now Grok is an editor, designer and assistant in one. You can tast here - https://grok.com/
0 reply
0 recast
1 reaction

Alex pfp
Alex
@alexdyor
Visa unveils AI-powered smart system that enables shopping and payment on behalf of consumers through partnerships with Anthropic, OpenAI, and others 🔵 The smart system uses AI-ready cards with tokenized credentials. AI agents can find and buy products without revealing card details. Consumers can set spending limits and terms 🔵 Mastercard's Agent Pay is a similar platform that enables payments by interacting with AI agents to find and buy products.
0 reply
0 recast
1 reaction