Overview
What is OpenText Vertica?
The Vertica Analytics Platform supplies enterprise data warehouses with big data analytics capabilities and modernization. Vertica is owned and supported by OpenText.
Good analytical database
Robust Vertica Experience
Vertica Review
Analysis at Scale
Fast with some limitations
Vertica's Strengths and Weakness
Fast and powerful analytics platform
Pricing
What is OpenText Vertica?
The Vertica Analytics Platform supplies enterprise data warehouses with big data analytics capabilities and modernization. Vertica is owned and supported by OpenText.
Entry-level set up fee?
- No setup fee
Offerings
- Free Trial
- Free/Freemium Version
- Premium Consulting/Integration Services
Would you like us to let the vendor know that you want pricing?
49 people also want pricing
Alternatives Pricing
What is ClicData?
ClicData is a 100% cloud-based business intelligence platform that allows users to connect, process, blend, visualize and share data from a single place. As an automated platform, users are able to rely on the latest version of company data, to ensure users make the right decisions. Hundreds of…
What is retailMetrix?
RetailMetrix is a data analytics platform for retailers with the mission of enabling retailers to get value from their data. RetailMatrix processes and stores sales, labor and customer data using data warehouse technologies. Its dashboards and reports allows team to find the data that matters to…
Product Demos
Vertica in-DB Machine Learning Demo
How to recover a HP Vertica Database Node from a Corrupted Catalog
Vertica Optimized for Multiple Clouds Using Attunity Replicate
vertica and elastic search demo
WEBINAR: Predictive Analytics with Vertica
Utilizing Tableau and HP Vertica Demo - Consolidating Worksheets into a Single Dashboard
Product Details
- About
- Tech Details
What is OpenText Vertica?
OpenText Vertica Video
OpenText Vertica Technical Details
Operating Systems | Unspecified |
---|---|
Mobile Application | No |
Comparisons
Compare with
Reviews and Ratings
(29)Community Insights
- Business Problems Solved
- Pros
- Cons
- Recommendations
Vertica has become a crucial tool for businesses looking to analyze large volumes of data for various use cases. Users have found it particularly valuable as a data warehouse for analyzing internal business data and marketing results of clients. Its ability to handle large data sizes enables analysis at a level that would not have been possible otherwise. Uber, for example, has successfully employed Vertica for their data analytics needs. Additionally, companies have created Vertica-based data marts to provide analytics insights and support data science across their entire organizations.
One key advantage of Vertica is its complementary nature with other technologies like Hadoop. By leveraging its high scale capabilities, Vertica enhances data efforts when used alongside Hadoop. The software also serves as the main data warehouse, acting as a source for analytic reports and facilitating data analysis activities. Interestingly, users have discovered non-traditional applications for Vertica, utilizing it as a powerful data processing engine to solve problems at scale. For instance, in the entertainment industry, Vertica is instrumental in rendering data and performing big data analysis tasks efficiently.
The speed of Vertica is highly beneficial to users, allowing them to quickly complete ad-hoc queries and conduct more in-depth analyses. This speed sets Vertica apart from competitors in the highly ingested, fast query analytics niche, including platforms like Teradata, Greenplum, Exadata, and Netezza. Moreover, Vertica excels in handling large amounts of data ingestion quickly, making it a reliable tool for organizations dealing with vast quantities of information.
Furthermore, Vertica serves as an analytics database that can handle real-time streaming data from sources like Apache Kafka. This capability enables organizations to gain near real-time customer insights for their consumer-facing web portals and mobile applications. Overall, users have come to rely on Vertica as an essential analytics database for reporting, ad-hoc queries, and more in-depth analyses across a wide range of industries and use cases.
Impressive Analytical Querying Capabilities: Several reviewers have praised Vertica for its impressive analytical querying capabilities. Users have found the built-in analytical functions to be powerful, allowing them to perform complex analyses across terabytes of data. This feature has enabled users to gain interesting insights and make data-driven decisions.
Efficient Data Ingestion: Many users have highlighted Vertica's efficient data ingestion process as a major advantage. According to reviewers, billions of rows can be easily sent to Vertica via the WOS system, and the data is ready for immediate use. This streamlined data ingestion process not only saves time but also enables quick analysis, enhancing productivity.
Scalability and Performance: The scalability and performance of Vertica have been widely appreciated by reviewers. Users have mentioned that Vertica can scale reasonably well up to 10-20 nodes and handle hundreds of terabytes of data effectively. Additionally, many reviewers consider Vertica as one of the fastest query engines available, with tables containing billions of rows still delivering speedy results for analytical tasks.
Deletion Process: Users have expressed frustration with the deletion process in Vertica, stating that it does not fully delete when prompted and can cause delays in other processes. Some users have reported this issue.
Permissions on Table Manipulation: Reviewers find the permissions on table manipulation lacking in Vertica, as only the owner of the table can edit its structure. This makes it difficult to set up true administrators who can maintain each other's work. Several users have mentioned this limitation.
Handling Petabyte-Scale Data: Vertica struggles to handle petabyte-scale data according to user feedback. It starts to crumble beyond hundreds of terabytes of data. Numerous reviewers have noted this scalability issue.
Users have made several recommendations based on their experience with Vertica. The most common recommendations are:
-
Proper Testing and Preparation: Users suggest that before releasing a major version of Vertica, it is crucial to have thorough testing in place. This ensures that any potential issues or bugs are identified and resolved prior to deployment.
-
Follow Vendor Configuration Instructions: It is advised to closely follow the vendor's configuration instructions when setting up Vertica. This helps ensure optimal performance and stability of the tool.
-
Training and Familiarity: Users recommend sending database administrators (DBAs) for training and studying the SQL limitations of Vertica. It is important to have a good understanding of Vertica and its capabilities to effectively leverage the tool for solving specific business problems.
It is important to note that while Vertica is highly recommended for data warehousing, solving Big Data solutions, and analytical data warehousing, users also suggest considering other database systems if there is not a significant amount of data that needs to be accessed quickly or if a more common/easier-to-set-up system would suffice.
Attribute Ratings
Reviews
(1-7 of 7)Good analytical database
- Column-oriented storage organization, which increases performance of queries.
- Compression, which reduces storage costs and I/O bandwidth. High compression is possible because columns of homogeneous datatypes are stored together and because updates to the main store are batched.
- Shared nothing architecture, which reduces system contention for shared resources and allows gradual degradation of performance in the face of hardware failure.
- Easy to use and maintain through automated data replication, server recovery, query optimization, and storage optimization.
- Support for standard programming interfaces ODBC, JDBC, ADO.NET, and OLEDB.
- Integration to Hadoop with the capability to perform analytics on ORC and Parquet files directly.
- Per TB licensing. Users have to worry about license usage at all times which becomes a challenge with you are working in an organization with huge amounts of data.
- The geospatial functionality could be designed better.
- Support for containerization and flexibility from the deployment standpoint.
Robust Vertica Experience
- After the initial setup and performance tuning phase, Vertica database cluster pretty much runs on its own. We haven't had too much maintenance to do.
- When we had to scale up the cluster from 6 nodes to 12 nodes, it was an easy task.
- At one time, because of some issues with a server, we had to take a node out and could do it on the fly.
- One time, one of the nodes wasn't coming up because of some ambiguity with the local data. Vertica wasn't able to fix it by itself and we were trying to remove the node out of the database and we couldn't do it. It would be great if that could be addressed. Luckily when we rebooted the whole server, some of the dead transaction got flushed because of which vertica was able to recover and the node came up.
Vertica Review
- It is able to intake real-time streaming data without much pre-processing and latency.
- Easy to integrate with real-time streaming ingestion engine.
- Vertica does not perform well when you have a lot of schemata.
- The management console including GUI is lacking features and can be improved with features that are typical of a database.
Analysis at Scale
- Analytical querying due to built in analytical functions that actually perform across TB of data.
- Ingestion of data. We can send billions of rows to Vertica easily via the WOS system and it is ready for use immediately.
- Efficient storage of data. What raw is TB of data, once ingested into Vertica only takes up GB of disk space.
- Management! The management console is intuitive and useful making keeping an eye on your cluster easier than any other product like this I have used.
- Deletion is tough in Vertica. Because one of our larger fact tables is rapidly changing we have a need to run purges on a regular basis. Those purges can take a day and delays the other processes while that is happening. It would be nice if when I hit delete, it really deleted.
- Permissions on table manipulation is a bit lacking. In order to edit a table structure you have to be the owner, ie the creator, of the table. It means setting up true administrators who can maintain each other's work is tough.
Fast with some limitations
- Speed. Even with tables with 20 Billion+ rows, Vertica performs reasonably well.
- Analytical functions. Some of the advanced functions in Vertica enable/facilitate interesting and complex analyses.
- Reliability. We never run into reliability issues with Vertica.
- Pricing: Vertica can get pretty expensive with large data sizes.
- Speed: Queries could always be faster!
- Limited options for querying clients: We primarily use Vertica from our terminals. Options for GUI clients are ugly and outdated. Using the terminal for querying is sometimes annoying, with problems like showing query runtime only in milliseconds and not being able to change it, columns being hard to read when there are more columns than the display space etc.
Vertica's Strengths and Weakness
- Extremely fast query performance - Vertica is one of the fastest query engines out there.
- Scales to TBs - Scales reasonably well up to 10-20 nodes and 10 - 100s of TB of data.
- Easy to Use - Fairly easy to user, we made quite some headway with just 1 person running it for a while.
- PetaByte Scale data - Vertica Just cannot deal with this, it starts to crumble beyond 100s of TB of data.
- Concurrent Usage - Vertica starts to have significant backpressure as your concurrent users grow quickly. We had trouble scaling post 20-30 users and had to invent our our queuing strategies.
- Vertical stack - storage + compute tier in one stack, this doesn't help the cause of scaling. Other systems leverage the advantage of storage and compute being different tiers (eg: HDFS + Presto)
Scaling for PB data and 1000s of DAU is vertica's weak point. The system is just not designed for large scale usage and still has a long way to go to improve scalability. There are experiments to run Vertica query engine on top of HDFS which seem promising, however - if you have the the Hadoop ecosystem you are better off going the HDFS + Presto/Impala/SparkSQL route. But if you are in the Hadoop ecosystem, you probably are already investing a lot in ops.
Fast and powerful analytics platform
- IO optimized - it's a columnar store, no indexing structures to maintain like traditional databases, the indexing is achieved by storing the data sorted on disk, which itself is run transparently as a background process.
- Reduced data storage footprint through advanced encoding schemas (RLE, common-delta, etc.) as well as compression algorithms ability to operate directly on the encoded data.
- Could use some work on better integrating with cloud providers and open source technologies. For AWS you will find an AMI in the marketplace and recently a connector for loading data from S3 directly was created. With last release, integration with Kafka was added that can help.
- Managing large workloads (concurrent queries) is a bit challenging.
- Having a way to provide an estimate on the duration for currently executing queries / etc. can be helpful. Vertica provides some counters for the query execution engine that are helpful but some may find confusing.
- Unloading data over JDBC is very slow. We've had to come up with alternatives based on vsql, etc. Not a very clean, official on how to unload data.
Vertica is not the silver bullet but based on my experience in 9/10 cases in which you need an analytical database, Vertica is probably the answer.
Currently we're using Vertica more as a data processing engine in conjunction with a Hadoop cluster as some of the steps are way more efficient than doing them in Hadoop and easier to manage (e.g. iterative processing steps). We also had a pretty good experience using it with Storm and Hadoop.