Key Takeaways
ChatGPT Plus now allows you to upload images for analysis, enabling identification of plants, animals, and more with multiple photos to narrow down results.
ChatGPT has the ability to analyze intricate diagrams, including flowcharts and slides, by providing explanations about their content and addressing specific queries. It is also capable of efficiently sorting information present in images, such as organizing book or movie titles alphabetically. Consequently, ChatGPT proves to be a valuable tool for enhancing organization and improving accessibility.
Identifying Plant and Animal Life (And Most Things)
: Increased Capabilities for ChatGPT Plus UsersIf you share my passion for capturing images of plants and animals, you'll be delighted to know that ChatGPT can now lend a hand in identifying them, albeit to a certain extent. My personal experiment involved submitting photographs of spiders, and overall, ChatGPT demonstrated the capacity to accurately identify the major spider categories, though not necessarily the exact species.
The unique advantage of ChatGPT in this scenario, unlike Google Lens, is its ability to refine the results based on multiple photos. During my testing, I shared several pictures of the same spider throughout the conversation, and ChatGPT seemed to utilize this supplementary information to approach a more accurate solution.
Understanding Complex Diagrams
If you have received a flowchart or a complex set of PowerPoint slides, ChatGPT can help you understand and interpret them. It is capable of explaining the visual content and addressing any specific queries you may have.
For instance, I have obtained a remarkable flowchart from the University of Alberta that outlines the criteria for determining if something falls under the public domain according to Canadian law. I then utilize ChatGPT to evaluate whether Alice in Wonderland meets the qualifications specified in the flowchart.
It is important to bear in mind that ChatGPT is still susceptible to errors or potentially fabricating information. Hence, it is advisable to verify the responses provided by the model. Alternatively, you can conduct multiple analyses in different chat conversations to ascertain if the outcomes remain consistent.
Sorting Information
I experimented with ChatGPT and discovered two fascinating applications. Initially, I captured an image of my bookshelf and requested ChatGPT to compile a list of all the books in alphabetical order. The outcomes were impressive, although the legibility of the text in the photo depended on your photography skills and the quality of your camera.
Afterwards, I decided to capture an image of our DVD/Blu-Ray collection display and requested ChatGPT to generate a flawless alphabetical listing of all the movie titles. I believe this impeccable outcome can be attributed to the clearer visibility achieved from the photo I took.
You might have numerous ideas for utilizing this functionality, but my initial thought was to use it for locating items in my personal collection. As an instance, I inquired ChatGPT about the whereabouts of my "Dune" copy in the image, and it performed well, except for mistakenly identifying the top of the bookshelf as one of the shelves.
Various Accessibility Options
ChatGPT Plus, with its new voice chat capabilities in the mobile app, has become a powerful accessibility tool when combined with its image input abilities. By simply taking photos and sending them directly to ChatGPT, users can engage in conversations about the images by utilizing the chat mode (activated by tapping the headphone icon). This feature is particularly beneficial for individuals with visual impairments who struggle with complex scenes but are still able to capture photos. Ultimately, this innovation has the potential to revolutionize their experience.
If you utilize images on your website or share them on social media platforms, ChatGPT offers a new feature that allows you to generate comprehensive and descriptive ALT text. ALT text is crucial for visually impaired users as it provides image descriptions for screen readers. Though typically written manually, platforms like Facebook and X (formerly Twitter) enable users to add ALT text to their images. By prioritizing accessibility and visually impaired audiences, ChatGPT enables you to swiftly produce a detailed ALT text description and easily verify its accuracy.
Creating AI Image Prompts From Images
Coming up with prompts for AI image generation tools like MidJourney of DALL-E is more challenging than it may seem. However, advancements in technology, such as DALL-E 3, have improved the comprehension of natural language prompts. This allows the tool to accurately adhere to your specific requests. Despite these enhancements, not everyone is meticulous about providing detailed prompts.
One interesting feature of ChatGPT is its ability to generate a prompt based on an image that you provide. So, if there's a particular AI-generated image or any other image that you admire, you can request ChatGPT to write a prompt reflecting the image's contents. This prompt can serve as a starting point rather than starting with a blank page.
Writing Based on AI Images
We can change our approach and instead of requesting a prompt to produce images, we can rely on ChatGPT to incorporate AI-generated images into the process of creative writing. Here, I have utilized AI to create some captivating fantasy art, and then solicited ChatGPT's assistance in devising a corresponding story concept. This can serve as a catalyst to enhance your own imaginative ideas.The Tip of the Iceberg
These are only a few examples of the easily accessible visual options available in ChatGPT. I anticipate that imaginative users will discover even more ways to enhance productivity and simplify tasks in the upcoming days and weeks. However, we do acknowledge the possibility of new malicious applications emerging, but only time will reveal the extent of that. For the moment, tech enthusiasts have an exciting new tool to experiment with.