Navigating the AI Revolution: An In-Depth Analysis of Over 4000 AI Tools & Choosing the Right One

Insights into the Evolving Market and Emerging Trends in AI. Welcome to an exciting exploration of the AI landscape!

Apratim Sahu
16 min readDec 20, 2023

In this article we will go deep into wide range of AI tools to uncover their capabilities and potential applications. It is based on an analysis of a dataset comprising 4,293 AI tools and I’m eager to share fascinating insights that emerged during this process.

Let’s dive in and see what AI has to offer!

Navigating through the Web of AI Innovations (Generated with Midjourney, word cloud made in Python)

Evolution of AI: A Story of Progress & Innovation

Artificial Intelligence has come a long way, transforming significantly in recent years. It has now reshaped our daily lives and the way we conduct work. In this segment, we’ll rewind to the beginnings of AI and trace its evolution to the advanced technology we rely on today. Here is the fascinating journey of AI highlighting key milestones and challenges. Swipe on this interactive carousel to see how AI has evolved.

AI is in a golden age and solving problems that were once in the realm of sci-fi. — Jeff Bezos

AI Tool Data Preparation:

I’ve utilized web scraping with Python libraries Scrapy & Selenium to extract detailed information on 4,293 AI tools listed on Futurepedia as of October 2023. Futurepedia is recognized as the largest AI tool directory.

Here’s an overview of the dataset attributes:

Address: Direct link to each AI tool’s page on Futurepedia.
Tool: Name of the AI tool.
Description: Summary of tool’s functionality averaging 180 words in length.
Upload Date: Date when tool was listed on Futurepedia directory.
Categories: Sectors associated with the tool (can be multiple for a tool)
Plan: Type of access available — free, freemium, paid or with a free trial.
Votes: The number of upvotes from users reflecting the tool’s popularity.

In addition to these attributes, I’ve captured the redirect URLs which link to the actual websites of the tools. Using these URLs, I’ve determined the country associated with each domain.

For this geographical tracking, I leveraged the GeoIP2 Python library, which works with the GeoLite2-Country database. This database, which can be downloaded as a file named ‘GeoLite2-Country.mmdb’, allows the program to determine the country associated with each tool’s domain.

To enhance the analysis, I leveraged the cutting-edge OpenAI Embedding Model to generate a 1,536 dimensional vector from each tool’s description. These vectors capture more than just text, they reveal the semantic meaning or context of the descriptions, providing deeper insights into each tool’s purpose and use.

Rise of AI Tools Across Categories:

This interactive timeline illustrates the shifting landscape of AI tool development and the rise and fall of dominant categories from Nov 2022 to Oct 2023. The visualization shows the addition of new AI tools across different categories each week. Since tools can span multiple categories, the total count is 4914 which is higher than 4293 individual tools listed.

Initially, the list was dominated by tools for copywriting and search engines, which were quickly surpassed by the short-lived ‘fun’ category. However, this category soon faded from the top 10. Notably, productivity tools have demonstrated significant growth, overtaking other categories and emphasizing the evolving focus and advancements in the AI domain.

Monthly Momentum: Tracking Tool Listings Across Top AI Categories

This interactive line chart captures the rise and fall in the number of tools listed across the top 10 categories. Notice the significant peak in activity between April and July 2023 for all the categories. You may select specific categories from dropdown menu to draw comparisons.

AI is probably the most important thing humanity has ever worked on. I think of it as something more profound than electricity or fire. — Sundar Pichai

The Landscape of AI: Dominant Categories in the Field

This bar chart shows top 35 categories, each bar represents the number of tools within a category. With ‘Productivity’ taking the lead followed by ‘Personal Assistant’ & ‘Writing Generators’, it’s clear that efficiency-enhancing tools are in high demand.

Community Favourites: Voting Patterns Across AI Tool Categories

This detailed box plot reveals how user preferences are distributed across the top 10 AI tool categories based on votes received. The median vote count is 21, indicated by a red dotted line. This signifies that half of the AI tools from these categories have received more than 21 votes. The green dotted line represents the 80th percentile of votes, set at 75. This indicates that the top 20% of tools within these leading categories have accumulated at least 75 votes.

‘Education’ category outshines others with the highest median vote count followed by design generators and social media, reflecting the community’s prioritization of learning and development tools. Some categories show a wide range of votes others display a more concentrated pattern, signaling different levels of market maturity and user engagement.

Quadrant Analysis of AI Tool Categories

This scatter plot bifurcates the top categories in 4 quadrants on the basis of number of tools (x-axis) and median votes (y-axis) offering insights into both popularity and presence. We gain valuable insights into their market saturation and user engagement levels.

The categories in green (such as Education) within the top-right quadrant, indicate they are not only abundant in number but also highly valued by users, as evident by their median votes. This reflects a strong demand and user base that actively engages with these tools.

Categories like ‘Search Engine’ and ‘Video Generators’ may have fewer tools in the market but their higher median votes place them in the top-left quadrant pointing to a possibly untapped demand. These categories might represent niche markets where quality and specificity are more important to users than variety.

The categories that fall in the bottom-right quadrant might be oversaturated or emerging fields where user engagement has not yet solidified.

AI Tool Trends: Monthly Growth and Category Dynamics

This stacked bar chart illustrates the monthly growth and fluctuations within the AI tool landscape.

While there’s a noticeable overall dip in the total number of new AI tools in the recent months the productivity tools marked in light green has expanded in %, indicating a growing emphasis on tools designed to boost efficiency and streamline workflows.

Similarly, the startup category is on the rise. This suggests an increasing interest in tools that support new business ventures and innovation.

This chart reflects changing priorities in AI tool development and hints at broader market and industry shifts.

Just as electricity transformed almost everything 100 years ago, today I actually have a hard time thinking of an industry that I don’t think AI will transform in the next several years. — Andrew Ng

Month-Over-Month AI Tool Highlights:

Dive into the monthly star performers in the AI tool space with this insightful chart. Tools released earlier tend to accumulate more votes over time, so I’ve made month-wise comparisons to ensure fairness.

The red dotted vertical line marks the median vote count for each month, providing a benchmark for success, while the green dotted line represents the 90th percentile. This visualization is color coded to distinguish the top 10% of tools in the green zone, the moderately successful ones in yellow, and the tools that are still gaining traction in the red zone. Labels pinpoint some of the most voted tools in their respective months.

Category Champions: Best AI Tools in Each Sector

This interactive radial chart highlights 8 most voted tools within the top 10 categories. The points in inner circle indicate categories, branching out into different plans such as free, freemium and paid. Each bar signifies the number of votes a tool has. Interact with the visualization by clicking on any category or use the dropdown menu.

The development of artificial intelligence will be the most tremendous leap forward for people’s quality of life. — Sam Altman

AI Tool recommendation Widget

This tool provides AI recommendations based on user requirements. Input your specific needs into the text box and the widget will list the top 10 AI tools that best match your criteria along with their similarity scores and URLs for direct access.

The recommendation process involves converting user input into a vector through the OpenAI embedding model. The resulting vector is then compared against a dataset of pre-processed AI tool descriptions, also converted into vector embeddings. Cosine similarity scores are computed to find closest matches and the top 10 tools are displayed.

The backend of this system uses a Flask API to handle requests. For performance efficiency, vector embeddings are stored in a parquet file format within Google Cloud Storage to enable quick retrieval and computation of similarity scores.

Here is a sample output:

The real risk with AI isn’t malice but competence. A super-intelligent AI will be extremely good at accomplishing its goals, and if those goals aren’t aligned with ours, we’re in trouble. — Stephen Hawking

2D Representation of Embeddings: Visualizing AI Tool Similarities

Here is a visual representation of AI tool embeddings. By applying the t-SNE algorithm, I’ve reduced the high-dimensional space of the embeddings from 1536 dimensions down to 2. I have selected top 800 tools on the basis of votes to be shown in a single 2D scatter plot.

The interactive plot created using Bokeh library in Python assigns each AI tool a point in 2D plane. Tools with 700 votes or more are marked in green while those with fewer votes are in red. A higher number of votes results in increased opacity, making the point more prominent. Hovering over these points reveals additional details such as the tool’s category, plan and votes.

The closer the points are in this scatter plot, the more similar the tools are to each other. Neighboring points around any given tool indicate the most closely related options. Zoom into the plot to explore these relationships.

If you wish for a magic genie, that gives you any wish you want, and there’s no limit. You don’t have those three wish limits nonsense, it’s both good and bad. One of the challenges in the future will be how do we find meaning in life. — Elon Musk

Geographical Distribution of AI Tools

Following analysis shows dominance of certain countries in the field. Top 8 countries including those labeled as ‘undefined’, account for 95.5% of all AI tools listed. United States leads the chart with India commendably taking second place.

Following stacked bar chart visualizes monthly contribution of each country to the total count of AI tools uploaded.

Despite fluctuations in numbers, the proportion of tools from each country remains relatively consistent over time. This indicates a stable geographical landscape for AI tool development.

Plan Analysis: Distribution and User Engagement

The majority of tools fall into the Free category (34%) followed by Freemium (26%), Paid (18%), Free Trial (15%) and Contact for Pricing (7%). Only a negligible fraction of tools (0.5%) offer multiple plan options which I’ve excluded for a clearer analysis.

We observe a gradual increase in the percentage of ‘Paid’ tools, suggesting a market trend towards monetization and perhaps an increase in premium offerings where users are willing to invest in higher-quality paid solutions.

The ‘Freemium’ category shows a decreasing trend in its share of the total AI tools. This could indicate a transition away from the freemium model or a saturation of offerings that initially aimed to attract users with free services before converting them into paying customers.

Below boxplot sheds light on the user engagement across different plans as measured by votes. Each point represents an AI tool within the respective plan category.

‘Free’ plan tools not only have the highest number of offerings but also attract more votes with a median vote count of 47, which stands out against the other categories. This is significantly higher than the overall median of 21 votes marked by red dotted line. The green dotted line marks the 80th percentile of votes at 75. Over 75% of tools in the ‘Paid’ category receive fewer votes than the overall median, indicating that users are less inclined to vote for paid tools.

It’s not going to be about man versus machine, it’s going to be about man with machines. — Satya Nadella

Quarterly Breakdown of AI Tools by Plan and User Engagement

The data provides a clear quarterly overview of AI tool performance across different plans for the year 2023, showing the changes in tool count and votes relative to the previous quarter.

All plans experienced a peak in tool count during the second quarter. The third quarter saw a significant downturn particularly for Free and Freemium plans, which suffered the most notable declines.

Considering the expected accumulation of votes over time, a drop in votes from quarter to quarter is apparent. Free Trial plan recorded a significant reduction in median votes in Q2, while Paid plans encountered most substantial decrease in Q3, highlighting changing trends in user engagement and tool popularity.

Free tools consistently outperformed other plans in voting across all percentiles suggesting that Free tools are successful in maintaining user interest and positive reception over time.

Intersection of Categories and Plans

The stacked chart reveals how different plans are distributed within the top categories of AI tools.

Categories such as Image Generator, Customer Support, Image Editing and SEO have a larger proportion of Paid tools, indicating a trend towards monetization in these services. Categories like Fun predominantly feature Free tools, aligning with the expectation that entertainment or casual use tools would be more accessible to users.

The treemap visualization shows the top 7 categories in each plan (excluding ‘Contact for Pricing’). The size of each block represents count of tools and color indicates median number of votes which is also labeled for clarity.

‘Free’ tools in Education, Startup, Writing Generators, and Image Generators categories have the highest median votes implying that they have successfully engaged a substantial user base.

Unveiling Common Themes through Word Frequency Analysis

In this word frequency analysis, I’ve utilized natural language processing to extract major themes, features or capabilities from the tool descriptions. The analysis identifies the most recurrent words, bi-grams (pair of consecutive words) and tri-grams (group of 3 consecutive words) within each tool category. Analysis excluded stopwords (commonly used word such as- the, a, in) which carry little useful information.

For example, ‘Productivity’ category frequently mentions ‘ai assistant’ and ‘save time,’ emphasizing efficiency and AI’s role in boosting productivity.

Here are the most common words, bi-grams and tri-grams for the productivity category. Number in parentheses indicates the count of occurrences for the respective term across all productivity tool descriptions.

Insights are illustrated in the interactive sunburst chart where each segment corresponds to a category and showcases its top five bi-grams and tri-grams. Simply select a category from the dropdown menu or click directly on a segment to navigate the chart.

Key themes identified in each category include:

AI is like nuclear energy, both promising and dangerous. — Bill Gates

Advanced AI Tool Recommendations: Harnessing TFIDF Vectorization, TSNE Clustering and Bokeh Visualization

I’ve created a personalized tool recommendations model based on the similarity of tool vectors considering the tool’s description, categories, plan and votes. These elements are weighted at 3, 0.5, 0.2 and 0.2 respectively to reflect their significance in similarity score calculation.

Tool descriptions undergo TFIDF vectorization which assigns weight to words, bi-grams and tri-grams present in a tool description based on their uniqueness and relevance across the dataset. The TFIDF settings used are: TfidfVectorizer(ngram_range=(1, 3), max_df=0.7, min_df=2, stop_words=’english’).

For example, here are the top 10 TFIDF features for ChatGPT, each with its corresponding weight, showcasing the tool’s focus areas:

virtual agents (0.624), understands (0.406), chatgpt (0.403), agents (0.401), customer (0.374), conversational (0.333), virtual (0.326), seeking automate responses (0.323), input provide (0.323), leverages openai technology (0.323)

After one-hot encoding categorical variables (categories and plans) and scaling votes, the final tool vector dimension is 92,885 for each tool. A similarity matrix is then computed to determine the similarity between tools.

For instance, top 12 most similar tools to ChatGPT are Kore.ai, Cohere, Agent4, TeleWizard, Typemagic, echowin, Corpora, AiCogni, Ebi.Ai, Giti, BulkGPT, Visus.ai This reveals a set of tools with a shared focus on virtual agents and conversational AI, leveraging AI technology.

To present these relationships visually, I’ve used t-SNE for dimensionality reduction, condensing the high-dimensional vectors down to a 2D space, which is then illustrated through an interactive Bokeh plot. Kmeans clustering organizes the tools into 24 distinct groups, each marked by a different color in the visualization. Closer points indicate higher similarity and the plot allows for zooming and hovering to reveal further details about each tool.

This combination of TFIDF vectorization, clustering and interactive visualization creates an advanced yet user-friendly platform for navigating the AI tool ecosystem, highlighting how tools relate to one another in terms of functionality and focus.

URL Analysis: Hosting Platforms and User Engagement

Majority of tools have their own dedicated websites or are hosted on various platforms. By examining the final redirect URLs of these tools I’ve identified the most common platforms, which could influence how users discover and interact with these tools.

Here are the top 15 hosting platforms for AI tools along with the respective count of tools: apps.apple.com (55), chrome.google.com (46), appsumo.com (24), github.com (21), play.google.com (18), bubble.io (7), huggingface.co (6), messengerx.io (6), founderpal.ai (3), ibm.com (3), preppally.com (3), cloud.google.com (3), adaptify.ai (3), buildai.space (3), openai.com (3)

The table details 25th, 50th (median), and 75th percentile of votes for tools published on the top 5 hosting platforms.

While github.com is noted for hosting high-quality tools, as indicated by their median user votes, it’s interesting to see that apps.apple.com, despite being the top hosting platform by count, has received the least number of user votes. This discrepancy points to the difference in user engagement across platforms.

AI Tools’ Success Prediction Using Machine Learning

This analysis is aimed to predict the success of new AI tools. The key to this analysis was to calculate a percentile score for each tool based on votes received, compared with other tools in the same upload month. This time-sensitive approach provides a fair comparison by accounting for the changing landscape of popularity and competition.

Utilizing this data, a Support Vector Machine (SVM) model was trained with tool descriptions, plans, and categories serving as input variables. Features were engineered through TF-IDF vectorization of descriptions (weight=1), one-hot encoding of categorical variables (weight=0.25), and a combined feature vector.

ML Model — Support Vector Machine (SVM):

The model was built using Support Vector Regressor (SVR) from scikit-learn, with hyperparameters optimized through grid search to minimize negative mean squared error (MSE). A 3-fold cross-validation was employed, harnessing multiple CPU cores to enhance efficiency. The dataset was divided into an 80/20 split for training and testing, maintaining consistency with a set random seed.

Model Evaluation:

The SVM model was evaluated using Mean Absolute Error (MAE) and Median Absolute Error (MedAE), with scores of 16.9 and 12.5 respectively. These metrics indicate that mean model predictions are generally within 16.9 and median within 12.5 percentile points of the actual values, which is considered moderate accuracy given the range of the percentile scores (0–100).

A higher predicted percentile indicates a higher likelihood of success.I evaluated the effectiveness of SVM model by comparing it with two naive baseline predictors:

A Mean Naive Predictor always predicts the average success rate of all tools in the training set, while a Median Naive Predictor always predicts the middle success rate, regardless of the tool’s specific details.

The comparison revealed that the SVM model’s predictions are substantially more accurate, with both the Mean Absolute Error (MAE) and Median Absolute Error (MedAE) scores of Naive predictor models averaging around 25. It suggests that SVM model is capturing more complex, meaningful patterns in the data rather than merely guessing based on central tendencies.

Comparison with Linear Regression:

I also compared the SVM model’s performance with a Linear Regression model. SVM model slightly outperformed Linear Regression model in terms of MAE (17.8) and MedAE (14.4) suggesting that it captures the data’s patterns more effectively.

Predicting New Tool Success:

For a new tool, the SVM model can estimate the success percentile after preprocessing the tool’s data. As a hypothetical example, a ‘productivity’ tool with a ‘Free’ plan with given ‘Description’ (The tool creates a schedule of gym workouts on the basis of user diet and medical history) have a predicted success percentile of 68, indicating a high likelihood of success.

The provided code outlines the entire process:

This analysis demonstrates the effectiveness of using machine learning, specifically SVM model, to predict the success of AI tools based on various features. Model outperforms baseline models and provides valuable insights into the potential success of new tools in the AI ecosystem.

Conclusion

The amazing story of AI shows how far we’ve come from its simple beginnings to now being a big part of our daily lives. It shows a world of technology that’s always changing and growing.

In the analysis, we found that free tools usually get most attention and votes. Also the AI landscape can change a lot from one quarter to the next. As AI keeps evolving, we have to make sure we use it in a way that’s fair and safe for everyone.

Thanks for taking the time to read! To stay updated on my future articles please consider following me.

You can connect with me via LinkedIn: https://www.linkedin.com/in/apratim24

Check my previous article:

--

--

Apratim Sahu

Growth & Analytics 📈 B.Tech M.Tech IIT Kharagpur 🎓 Passionate about Computer Vision and AI 💻 LinkedIn: linkedin.com/in/apratim24