Author: Olanrewaju Adeniyi

  • Google’s Gemini 2.0 Flash: AI tool that mimics playful voice, creates and edits images, texts 

    Google’s Gemini 2.0 Flash: AI tool that mimics playful voice, creates and edits images, texts 

    Google announced its next-generation AI model, Gemini 2.0 Flash, on December 11, 2024. This model extends its capabilities beyond text generation to include native creation of voice, graphics, and text.

    This flagship AI tool positions Google to compete directly with OpenAI’s advanced offerings and introduces enhanced multimodal features.

    Read also: OpenAI considers monetising ChatGPT, other AI products with ads

    Gemini 2.0 Flash has enhanced features and multimodal capabilities

    Gemini 2.0 Flash is designed to seamlessly generate and edit audio, images, and text. It can process multimedia inputs such as videos and audio recordings to answer contextual queries like “What did he say?” The audio generation feature allows customisation of speech, supporting eight optimised voices in various languages and dialects. Users can modify delivery styles, such as asking for slower speech or playful pirate-like tones.

    Developers accessibility to Google’s Gemini 2.0 Flash

    Today, developers can experiment with 2.0 Flash on platforms like Vertex AI, AI Studio, and the Gemini API. While audio and image generation features are restricted to early access partners, the production version of Gemini 2.0 Flash will become widely available in January 2025. 

    Additionally, Google is introducing the Multimodal Live API, enabling real-time audio and video streaming capability integrations into apps, similar to OpenAI’s Realtime API. 

    Read also: Google Gemini’s new memory feature remembers all your favourite restaurants, important dates

    Gemini 2.0 Flash offers improved speed, accuracy, and functionality

    Google claims that Gemini 2.0 Flash outpaces its predecessor, Gemini 1.5 Pro, in coding, image analysis, and factual accuracy benchmarks. It’s faster and more adaptable, and features enhanced arithmetic abilities, making it Google’s most robust Gemini model. The model is also integrated with SynthID watermarking technology to mark synthetic outputs, addressing concerns over deepfakes.

    Gemini 2.0 Flash represents a leap forward in AI innovation, combining real-time, multimodal capabilities with user-friendly features. By making powerful APIs and tools accessible to developers, Google aims to l in the race for practical, ethical, and advanced AI applications.

  • ChatGPT and Sora suffer major outage amid high demand

    ChatGPT and Sora suffer major outage amid high demand

     

    Olanrewaju Adeniyi

     

    OpenAI suffered a major outage around 3:00 p.m. PT on Wednesday, leaving users of ChatGPT, Sora, and its developer-facing API digitally stranded.

    The AI company confirmed the outage on its status page but issued another update around 9:00 p.m. PT, claiming its services have been largely restored.

    In a tweet, OpenAI stated, “ChatGPT, API, and Sora were down today, but we’ve recovered.”

    The cause of the outage was not yet known. Around 7:00 p.m. PT, OpenAI announced on their status page that ChatGPT, the API, and Sora were gradually returning to the Internet.

    ChatGPT error messages and impact

    During the downtime, ChatGPT.com displayed an error message that said, “ChatGPT is currently unavailable. We’ve located the problem and are attempting to implement a solution.”

    On the same day that OpenAI’s interface with Apple was introduced in iOS 18.2, ChatGPT went down. Due to Wednesday’s outage, some users complained on social media that ChatGPT was not functioning in Apple Intelligence. 

    In a post on X, Edwin Arbus, the leader of the OpenAI developer community, stated that the downtime had nothing to do with Apple Intelligence or 12 Days of OpenAI. “We changed the configuration, which made many servers unavailable.” 

    Sora’s launch challenges

    OpenAI also made Sora available to the public earlier this week. According to CEO Sam Altman, the business had to restrict the number of individuals who could join up because it didn’t foresee the volume of interest it would attract. On launch day, a large number of registered users were unable to create films because OpenAI’s servers were full.

    The disruption comes after Meta’s products had another worldwide service failure early on Wednesday.

    This product outage occurred on the fifth day of OpenAI’s “12 days of OpenAI” event, during which the firm has been shipping new products every day in the run-up to the holidays. OpenAI has announced the complete release of its o1 reasoning model, a research program focused on fine-tuning reinforcement, the release of Sora, certain Canvas upgrades, and the interface with Apple Intelligence. 

    Edited

     

  • YouTube expands AI auto-dubbing feature to knowledge-sharing creators

    YouTube expands AI auto-dubbing feature to knowledge-sharing creators

    YouTube officially rolled out its AI-powered auto-dubbing feature to a broader audience on December 10, 2024. Initially tested with select creators after its introduction at Vidcon last year, this tool is now available to hundreds of thousands of channels focusing on knowledge-based content, such as cooking tutorials and DIY projects.

    Read also: YouTube on your TV? Airtel Nigeria teams up with Google to convert analogue TVs into smart screens

    How YouTube enhances accessibility with auto-dubbing

    The feature is designed to make content accessible to global audiences by automatically generating audio tracks in multiple languages. Upon uploading a video, creators can benefit from automatic language detection and translation. Supported languages currently include English, French, German, Hindi, Indonesian, Italian, Japanese, Portuguese, and Spanish, with more to be added in the future.

    The technology behind YouTube’s new tool

    YouTube’s auto-dubbing leverages Google’s Gemini AI technology, known for its ability to mimic human speech. While this innovation bridges language barriers, the feature remains in its infant stages, with occasional translation or dubbed voice representation inaccuracies. YouTube emphasises that creator feedback is vital for improving the tool’s performance.

    Read also: YouTube bolsters deepfake detection tools

    In addition to refining the auto-dubbing tool, YouTube plans to introduce an “Expressive Speech” update. This enhancement aims to replicate the creator’s tone, emotions, and even the ambience of their surroundings, offering a more immersive experience for viewers.

    By expanding this feature, YouTube reinforces its commitment to making its content library inclusive and accessible. While it currently caters to knowledge-focused creators, the company has plans to extend the tool’s availability across other content categories, further cementing its global reach.

  • Instagram introduces “trial reels” for creative experimentation

    Instagram introduces “trial reels” for creative experimentation

    Instagram launched “trial reels” on November 10th, 2024, a feature designed to allow creators to experiment with content without displaying it to their followers.

    This innovative tool, first tested in May, is now available globally for professional account holders. The aim is to provide a platform for creators to refine their ideas while reducing the pressure of audience judgement.

    Read also: Meta enhances Threads with customisable default feeds, to compete with Bluesky

    How Instagram’s “trial reels” work

    “Trial reels” are shown exclusively to non-followers, ensuring creators can test their content without disrupting their established audience. By toggling the “Trial” option during reel creation, creators can share experimental videos that do not appear on their profile’s primary grid or reels tab.

    Performance metrics such as views, likes, and comments become available after 24 hours. Based on these results, creators can decide whether to archive or publish the reel for their followers.

    Instagram’s “trial reels” supports bold creativity

    According to Instagram’s Vice President, Ashley Alexander, this feature addresses creators’ concerns about experimenting with new genres or topics. Many creators hesitate to post experimental content because they fear alienating their established follower base. Trial reels eliminate this barrier, offering a safe space for creators to explore bold ideas without the risk of backlash.

    Alexander noted that this feature empowers creators to innovate and take creative risks, fostering a more dynamic content landscape. For instance, a fashion influencer can test singing videos without disrupting their audience’s expectations.

    Read also: Instagram rivals Apple and Snapchat with live location sharing in DMs

    Competitive edge for Instagram

    Trial Reels position Instagram ahead of competitors like TikTok, which currently lacks a similar feature for experimentation.

    Instagram aims to help creators improve their strategies and expand their reach by providing tools to evaluate content performance. This update is part of Instagram’s broader initiative to remain a leader in the creator economy.

  • Google’s GenCast could transform weather prediction, outperform European forecast’ service

    Google’s GenCast could transform weather prediction, outperform European forecast’ service

    This week, Google’s DeepMind team unveiled GenCast, an AI-powered weather prediction model that sets a new benchmark in forecasting accuracy.

    DeepMind researchers claimed in a Nature paper that GenCast outperforms the European Centre for Medium-Range Weather Forecasts’ ENS, widely regarded as the world’s leading operational forecasting system.

    Read also: Google hires new country director for South Africa

    GenCast’s revolutionary approach

    Unlike traditional deterministic models, which provide a single forecast, GenCast generates an ensemble of over 50 predictions. Each prediction represents a potential weather trajectory, forming a detailed probability distribution of future weather scenarios. This probabilistic approach allows for more nuanced insights into weather patterns and risks.

    DeepMind tested GenCast’s performance by training it on weather data up to 2018 and then comparing its forecasts for 2019 against those from ENS. Results showed that GenCast was more accurate 97.2 percent of the time, establishing its superiority in forecasting precision.

    Integration with Google Products

    GenCast is part of Google’s expanding suite of AI-driven weather models. The company plans to integrate GenCast into Google Search and Maps, enhancing users’ real-time weather information. Additionally, Google aims to release GenCast’s real-time and historical forecasts, enabling researchers and organisations to leverage its data for their projects.

    Read also: Google Photos’ new feature lets users delete Cloud storage without losing files on device

    Implications for Gencast’s weather prediction

    The launch of GenCast represents a paradigm shift in meteorology. Its probabilistic framework provides a more comprehensive understanding of weather outcomes, benefiting agriculture, disaster preparedness, and transportation sectors. By making GenCast’s forecasts publicly accessible, Google underscores its commitment to democratising cutting-edge AI for global impact.

    This advancement highlights AI’s transformative potential in addressing complex challenges, with GenCast leading the way in redefining how we predict and prepare for weather events.

  • Apple’s Vision Pro targets gamers with PlayStation VR2 support

    Apple’s Vision Pro targets gamers with PlayStation VR2 support

    Apple is looking to expand the audience for its Vision Pro mixed reality device by targeting gamers and game developers. Initially marketed as a productivity and media consumption tool, the Vision Pro’s reliance on eye and hand controls has limited its appeal among gamers who typically prefer precise controller-based inputs.

    With sales reported to be under 500,000 units, the company is exploring ways to make the device more attractive to this key demographic.

    Read also: HTC unveils premium mixed-reality headset to rival Quest 3 Pro

    Apple’s collaborations with Sony and other game developers

    To bridge the gap, Apple is reportedly in talks with Sony to enable support for the PlayStation VR2’s hand controllers. This move would allow for greater precision in gaming and open doors for developers to integrate these controllers into Vision Pro-compatible games. By adding hardware support for widely accepted controllers, Apple aims to make the Vision Pro a more versatile and appealing device for gamers.

    These enhancements may not only benefit gaming but also improve the functionality of professional software like Final Cut Pro and Adobe Photoshop. The precise input of external controllers could allow for more seamless interactions, making the Vision Pro a compelling choice for creative professionals seeking a mixed reality platform.

    Read also: Apple faces lawsuit over child abuse images on iCloud

    Expanding Apple Vision Pro’s potential

    Apple’s pivot to gamers reflects a strategic response to the competitive landscape of mixed reality technology, where devices like Sony’s PlayStation VR2 and Meta’s Quest series dominate gaming-focused segments. By addressing its input limitations and fostering collaborations, Apple hopes to attract a broader audience while establishing Vision Pro as a platform for both entertainment and productivity.

    This shift indicates that Apple recognizes the importance of balancing its core strengths in productivity and media consumption with the immense potential of the gaming market to drive adoption of the Vision Pro.

  • From classroom to home: Why Raspberry Pi 500 is perfect entry-Level computer for all ages

    From classroom to home: Why Raspberry Pi 500 is perfect entry-Level computer for all ages

    The Raspberry Pi Foundation has unveiled its latest innovation, the Raspberry Pi 500, which redefines simplicity and functionality in single-board computers.

    Serving as the successor to the Raspberry Pi 400, the Pi 500 packs the processing power of the flagship Raspberry Pi 5 while maintaining a user-friendly keyboard-integrated design. This updated model combines accessibility with advanced capabilities, making it ideal for newcomers and seasoned users.

    Read also: Raspberry Pi Pico 2 W: A budget-friendly, high-performance microcontroller for hardware projects

    Raspberry Pi 500 has a simplified design for all

    Designed with usability in mind, the Raspberry Pi 500 eliminates the intimidating appearance often associated with single-board computers. Instead of exposing chipsets and circuit boards, the components are housed within a sleek keyboard. This design ensures that users only need to connect a mouse and display to start using the device, making it a practical option for replacing outdated PCs or introducing computing to beginners.

    Enhanced specifications and functionality of Raspberry Pi 500

    The Raspberry Pi 500 has a 64-bit quad-core Arm processor, the same as the Raspberry Pi 5, and 8GB of RAM. It supports up to two 4K displays via its two micro-HDMI ports and features three traditional USB ports, a Gigabit Ethernet port, and a 40-pin expansion header. Native Wi-Fi and Bluetooth capabilities further enhance connectivity. While it lacks USB-C ports for general use (apart from the power port), the device is highly versatile.

    Storage is managed through a preloaded 32GB SD card running Raspberry Pi OS, a Debian-based Linux distribution. At $90, the Pi 500 offers affordability, with a $120 desktop kit that includes essential peripherals such as a mouse, a power supply, and an HDMI cable.

    The Raspberry Pi 500 reflects the Foundation’s original mission of promoting education and innovation. Its affordability and customisable nature make it an excellent tool for schools, offering students an engaging platform to explore computing. Unlike Chromebooks or iPads, it encourages hands-on creativity, fostering critical thinking and problem-solving skills.

    At launch, the Raspberry Pi 500 supports U.K. and U.S. keyboard layouts. However, additional variants for French, German, Italian, Japanese, Nordic, and Spanish markets are expected soon. This global approach ensures that the Pi 500 can cater to a broad audience, solidifying its position as an inclusive computing solution.

    Read also: Acer Nitro V with RTX 4060 review: A budget gaming Laptop that punches above its weight

    A complementary offering: Raspberry Pi monitor

    In addition to the Pi 500, the Foundation has introduced the Raspberry Pi Monitor, a 15.6-inch, 1080p display priced at $100. While not groundbreaking, this branded monitor provides a cohesive option for Raspberry Pi enthusiasts seeking a unified setup.

    The Raspberry Pi 500 exemplifies the Foundation’s commitment to making computing accessible, educational, and innovative. With its blend of power, affordability, and ease of use, it’s poised to be a game-changer in classrooms, homes, and beyond.

  • Meta’s Threads rolls out advanced search features to rival X and Bluesky

    Meta’s Threads rolls out advanced search features to rival X and Bluesky

    Meta is significantly enhancing its microblogging platform, Threads, with new search functionalities, signalling its intent to compete more aggressively with rivals like Bluesky and X (formerly Twitter).

    Announced on December 2, 2024, Threads now allows users to refine searches by user profiles and date ranges, improving its usability and aligning it more closely with modern platform standards.

    Read also: TikTok deletes 12 million videos across Africa, enforces stricter age rules

    Threads closes the search gap with X and Bluesky

    Before this update, Threads’ search capabilities were pretty basic, offering only two filters: Top posts for those with high engagement and Recent posts for the latest updates. While the new additions are a step forward, they remain less comprehensive than X’s advanced search, which supports filtering by language, keywords, hashtags, and more. 

    However, Threads’ latest changes make it more competitive with Bluesky, which offers filtering options like date ranges and user profiles but hasn’t fully integrated these features into its interface.

    How Meta’s Threads is addressing rising competition

    Threads has been under pressure to innovate as competitors like Bluesky gain traction. With nearly 24 million users, Bluesky has seen rapid adoption due to user dissatisfaction with X and its distinctive platform features. To keep up, Threads has introduced several enhancements. It now allows users to customise their feeds, offering more control over the content they see. 

    Another notable addition is the introduction of “Starter Packs,” curated recommendation lists aimed at helping users discover new accounts and communities. 

    Furthermore, Threads has rebalanced its algorithm, ensuring users see more content from the accounts they follow rather than random suggestions. These measures demonstrate Meta’s strategic intent to retain its user base while appealing to potential newcomers.

    Meta says the new search filters will be available globally in the coming weeks. These updates are part of Threads’ broader strategy to attract and retain users by addressing long-standing feedback. In addition to search enhancements, the platform has introduced trending topics with AI-driven summaries and a redesigned interface that streamlines navigation.

    Read also: Google, Microsoft, X, TikTok remove 65 million content posts, deactivate 12 million accounts in Nigeria: NITDA

    The bigger picture

    Threads’ improvements reflect Meta’s commitment to positioning the platform as a viable alternative in the crowded social media landscape. With over 275 million active users, Threads remains a strong contender, bridging the gap between the traditional Twitter experience and newer platforms like Bluesky. While it still has room to grow in advanced features, these updates demonstrate Meta’s focus on meeting user expectations and staying competitive.

  • From tasks to habits: Top Apple Watch apps for staying organised

    From tasks to habits: Top Apple Watch apps for staying organised

    The Apple Watch is often celebrated for its health and fitness features, but it can also be a powerful productivity tool, especially for those looking to minimise distractions from their phones. With built-in apps like Reminders and Calendar, the device offers basic functionality, but third-party apps can take your productivity to the next level.

    Read also: What to know about Tecno’s affordable Spark 30 Pro smartphone with 120Hz refresh rate 

    Todoist

    Todoist is a highly regarded task manager that allows you to create, organise, and complete tasks directly from your wrist. With customisable watch face integrations, you can use voice commands or the watchOS keyboard to add tasks and view your progress. The app is free, but premium features, including custom task reminders and calendar views, are available for four dollars per month.

    Drafts

    Drafts fill the gap left by the absence of a native Notes app for quick note-taking. It provides a blank canvas for capturing ideas using voice dictation, Scribble, or a keyboard. Notes can be flagged, organised into folders, and synced across devices. The app is free, but premium features like themes are available for $1.99 per month.

    Focus

    Focus is ideal for managing work sessions through structured “Focus Sessions.” This app encourages you to work in concentrated blocks, followed by breaks, to enhance productivity. It reduces distractions by keeping everything accessible on your wrist. Subscriptions start at $7.99 per month.

    AutoSleep

    While primarily a sleep tracker, AutoSleep supports productivity by offering insights into your sleep quality and readiness for the day ahead. Its readiness score and personalised sleep recommendations can help you optimise your daily routine. The app is available for a one-time fee of $5.99.

    Streaks

    Streaks are perfect for habit formation, allowing you to track up to 24 habits. Whether you aim to learn a new language, exercise, or adopt a healthier diet, this app visually tracks your progress and helps you maintain momentum. It costs $5.99 as a one-time purchase.

    Read also: Unpacking Apple’s upcoming Mac releases and new AI features

    Fantastical

    Fantastical is a versatile calendar app that provides an overview of your schedule, including weather updates and reminders. It offers multiple viewing options, such as a list of upcoming events or a compilation of tasks. The app integrates seamlessly with the Apple Watch and costs $4.99 monthly for premium features.

    These apps showcase the Apple Watch’s potential as a productivity powerhouse, transforming how you manage tasks, notes, time, and habits. Explore these tools to make the most of your Apple Watch.

  • Unveiling the OPPO Reno12 F 5G: High performance, design, and AI-powered photography

    Unveiling the OPPO Reno12 F 5G: High performance, design, and AI-powered photography

    The OPPO Reno12 F 5G has solidified its position in the competitive midrange smartphone market, combining performance, design, and innovative features to appeal to a diverse audience.

    Design and display of the OPPO Reno12 F 5G

    The Reno12 F 5G boasts a sleek design highlighted by its unique “Breathing Light” feature surrounding the camera module. This light not only enhances aesthetics but also serves practical purposes such as alerting users to notifications, calls, and charging status. It supports customisable colors and patterns, adding a personalised touch.

    Read also: Black Friday: DualSense controllers on sale for $55

    The phone features a 6.7-inch AMOLED display with FHD+ resolution and a smooth 120Hz refresh rate. This ensures vibrant visuals and fluid animations, making it ideal for multimedia consumption and gaming.

    OPPO Reno12 F 5G’s performance and software

    Powered by the MediaTek Dimensity 6020 chipset, the Reno12 F handles multitasking and moderate gaming with ease. It runs on ColorOS 14.0.1 based on Android 14, offering a clean, user-friendly interface with features like a Dynamic Island-inspired notification widget.

    The device comes with 8GB of RAM, 256GB of internal storage, and the option for expandable virtual RAM, catering to users who demand efficiency and storage flexibility.

    Camera capabilities

    Photography is a standout feature of the Reno12 F 5G. The rear camera setup includes a 50MP primary sensor, an 8MP ultra-wide lens, and a 2MP macro sensor. For selfie enthusiasts, the 32MP front camera delivers excellent results, complete with AI enhancements.

    Key features like AI Eraser, AI Smart Image Matting 2.0, and Pro Portrait Mode elevate the photography experience, enabling users to capture stunning images and perform creative edits effortlessly. The phone also supports Dual-View Video recording, combining footage from both front and rear cameras in real time.

    Read also: New Age reinvents charging with Its 80,000mAh Heavy-Duty Power Bank

    Battery and connectivity

    The device is equipped with a 5000mAh battery that supports 45W SuperVOOC fast charging. While not the fastest in its class, it ensures quick top-ups to keep you going. AI LinkBoost and 360-degree Surround Antenna enhance connectivity, making the phone reliable for daily use, even in areas with weak signals.

    The OPPO Reno12 F 5G is a versatile option for users seeking a stylish smartphone with advanced AI features, solid performance, and reliable battery life. Though its gaming capabilities and charging speeds may not rival higher-end models, it strikes an excellent balance between price and functionality, making it a worthy contender in the midrange segment.