Closed. This question is off-topic. It is not currently accepting answers.
Want to improve this question? Update the question so it's on-topic for Stack Overflow.
Closed 10 years ago.
Improve this question
I'm looking for a multi server big data sample application, which can be used (a) to experiment with installing and configuring a big data application, and (b) as an example starting point for developing such an application - editing the code, making some changes,etc... In most technologies (e.g. Java EE), such applications are very common, and are very useful as a starting point.
If it can be used for benchmarking, even better.
If it uses one (or more) of Hadoop, Cassandra, HBase, MongoDB, Hive, Redis it would be great.
Thanks!
You can use TeraSort, the benchmarking test packaged with Hadoop. It sorts terabytes of data, and is used to stress test new Hadoop clusters. It's part of the hadoop-x.y.z-examples.jar file that comes with a Hadoop install.
To use it, generate data into HDFS using Teragen, then run Terasort.
Related
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
This question does not appear to be about programming within the scope defined in the help center.
Closed 2 years ago.
Improve this question
I am using Win 10 64 bit and have just installed postgresql here: C:\Program Files\PostgreSQL\13. I have an older installation here, with lots of tables: F:\Program Files\PostgreSQL\9.6\data. Both drives are on the same computer. Can my new installation connect to my database on drive f:?
Does the data need to remain in two separate instances? If not, you could export the data from one with pg_dump, and import it into the other. Then decommission the old one.
If you need to maintain separate instances, you could connect them together with postgres_fdw. This is very convenient to query across the two instances, but performance usually suffers, often dramatically.
Closed. This question is opinion-based. It is not currently accepting answers.
Want to improve this question? Update the question so it can be answered with facts and citations by editing this post.
Closed 4 years ago.
Improve this question
Our current Development and Production systems are PHP v5.3.10/Ubuntu 12.04 environment. We Our new environment will be PHP Version 7.0.30/Ubuntu 16.04.
Is there a way to analyze our current PHP5.3 code and determine the scope of how much work would be needed once we port current code into our new PHP7.0 environment?
This will help give us a feel for how much work will be required to migrate from PHP5.3 to PHP7.0.
The performance is the big plus in PHP7, but the effort to migrate to it depends from your time/money and the traffic in the website, if it is heavy you should consider the migration as a real option.
For security reasons, maintenance, lots of important deprecated function, and the new cloud hosting services, I recommend to not use PHP5.3. Migrate to PHP5.6, can be a sooner solution for future problems. In the last days I have been experimenting hosting problems with mysql_connect(This extension was deprecated in PHP 5.5.0) function used in PHP5.3, and this is just an example
Closed. This question needs to be more focused. It is not currently accepting answers.
Want to improve this question? Update the question so it focuses on one problem only by editing this post.
Closed 6 years ago.
Improve this question
I successfully integrate the postgres and eclipse but unable to test the database. I need to know where can i get help if any suggestions would be appreciate.By the way i am using java. Thanks in advance.
Selenium is not the right tool here, as it is a tool for automating testing of web front-ends. You cannot directly test your database using this tool.
You can test an application like phppgadmin, but it isn't clear that is what you want.
A better approach is just to write test cases through jUnit and jdbc, in my view. That gives you better control and testing than you can via a web front-end and it is easier to solve the question of whether you want transactions to rollback or not (selenium poses this problem since it is indirect and tests a web app that interacts with the db over a stateless connection).
Closed. This question is off-topic. It is not currently accepting answers.
Want to improve this question? Update the question so it's on-topic for Stack Overflow.
Closed 9 years ago.
Improve this question
I am building a system that will allow users to upload Word, PDF, and Zip files.
Sometimes, they will review those files and upload newer versions. I will need to keep the older versions too, as they would also like to have the ability to download them.
Instead of storing all versions in full, I would rather store the first versions, and diffs for the successive versions. Basically, I need to store all versions of a document in the smallest possible space. (Maybe it can be done in a different way, without diffs?).
There is no need for merging changes, multi-user, distributed, or any advanced features that typically come with version control systems. All use of the tool will be made through my application.
Is there an easy-to-learn, lightweight, open-source package, with good JVM (or REST) interoperability, that I can use for this purpose?
No needs for reinventing the wheels.
JavaHL or SVNKit will allow you to work with Subversion repositories from Java
Closed. This question is off-topic. It is not currently accepting answers.
Want to improve this question? Update the question so it's on-topic for Stack Overflow.
Closed 9 years ago.
Improve this question
Hey guys
Our application runs on Jboss/Apache Tomcat and the app is build on Java.
I was wondering if theres a free app out there that let us monitor our log files which are in text format whenever a Jboss exceptions shows up in the log???
At the moment we are using a tail (called BareTail) application that highlights the errors for us and someone has to keep an eye on it.
Any help appreciated thanks.
Since you're using JBoss you also want to take a look at monitoring with JBoss Operations Network (JON) or RHQ the community equivalent to JON.
Some other log monitoring tools/solutions are Chainsaw, Graylog2, Scribe, Lilith, Syslog-ng and Clarity.
Check this free opensource monitoring tool Hyperic.
Among many things it does, log monitoring is one thing.
You can install agents on the servers and configure them and this Log server will collect the information and the logs can be viewed using a webbrowser.
Other tools are Splunk, Tivoli Log Monitoring