Wordpress parser. Free content parser - AftParser

Wordpress parser.  Free content parser - AftParser
Wordpress parser. Free content parser - AftParser

A fairly powerful universal parser for WordPress. Allows you to collect content from one or more sources and process it, adjusting it to the required format using all the features of the PHP language. There is a possibility of delayed parsing. The best free parser for wordpress at the moment - AftParser is always at your service!

Brief description of the functionality:

The parser consists of 4 pages: Main Page, Link Parser Page, RSS Feed Parser Page and Settings Page. Here's what it will look like after installation:

Attention: You can read how to set up AftParser.

Let's start with home page. It displays a list of parsers currently running.

Explanations are given in blocks on each page. All documentation is delivered out of the box, it is enough to read carefully to make everything clear.

Site Parser:

The site parser page allows you to parse data from both one and several sources. It is only necessary to provide links to materials.

What? Too lazy to dig through the network and manually collect materials? Don't despair - everything is automated.

There are two tools that allow you to fill in the list of links automatically.

- kind of emulation search engine. The robot will walk through the pages of the site transferred to it and collect all internal links from them.

Naturally, a list of links filled with automatic algorithms will be heavily littered with unnecessary data. This is where filters come to the rescue.

- the simplest and fastest way to filter. You enter the conditions and the filter itself performs the processing.

Advanced link filter- a link filter that allows you to change their content and do a bunch of other things. For experienced users only. I advise you to learn php before doing anything there.

If you have completed the link collection, then the next step is to add content borders.

With these boundaries, the parser will determine the areas that need to be processed.

Syntax highlighting is implemented using the ACE javascript editor. All documentation and all available features are listed on the plugin page. The list is very impressive and I can’t bring it here, since this material is already very long. Just install the plugin and read on, you will be impressed, I guarantee it.

The page looks the same rss feed parser, with the only difference that there is no requirement to provide lists of links.

There are quite a few areas of activity where the parser can be used, but mainly it is collection various information. You can quickly collect pictures and links only programmatically. Using a parser to search for information allows you to automate this process, significantly saving time. If you have a wordpress site, then you can easily make it autocomplete using the AftParser parser.

is a free, universal parser for WordPress. allows you to collect content from one or different sources, processing it to the desired format on PHP language. The parser is made as a plugin for WordPress. After the usual installation of the plugin, the parser menu will appear in the wordpress console, as in the picture.

The parser has two main tools: the wordpress site parser and the wordpress rss parser.

Parser for WordPress can:

1. Parses data from links

You just need to provide links to the source. If there are no links, they can be collected by the parser. Specify the path to the sitemap and the grabber will collect all the links. Or you can collect links from any html pages. Links can be filtered according to the criteria you need. There are two link filters with which you can change the parsing conditions. Power Users can compose macros for parsing themselves, which makes the parser very flexible for their needs.

2. Parses RSS feed data

Everything is simple here, enter the desired feed URL and click start parsing.
One of the possible uses of a parser for wordpress is to fill the news columns on your site using information sources such as blogs, rss feeds, VKontakte pages, and so on. Competitors - WP-O-Matic, FeedWordPress, CyberSyn.

Sometimes it doesn’t make sense to spend time writing text for a WordPress site on your own. This case does not apply to blogs and infosites, since the income on them is formed precisely thanks to the posted posts.

A we are talking about online stores, company websites and news portals that are not designed for organic traffic. For such resources, unique materials are not as important as their constant updating.

To make an autocomplete site, you will need to set up a news parser for your project. First you need to find suitable sites from which you will parse. They must match the theme of your project, otherwise there is no point in duplicating information from them. If so, then you need to proceed to the second part of solving the parsing issue - this is how you will clone text from another site. The most primitive and inconvenient way is manual copying. But it is much more reasonable to use one of the plugins given in this article to activate a successful news and content parser.

WP-O-Matic

A very popular module for WordPress that will allow you to set up a functional news parser from other sites. The tool is installed in a simple way: either by uploading directly to a folder on the hosting, or through the “Plugins” tab.

Next, you have to configure the plugin if you want to provide content parsing. To do this, just click “Next” four times and at the end “Submit”. By doing so, you agree to the terms of use of this WordPress module. In particular, agree that only you are responsible for the theft of other people's materials, the content of other sites, etc.

If, in addition to the text component, you are also interested in pictures, then you need to create a directory called Cache in the folder with the plugin. Set special permissions for this folder. Next, you have to return to WordPress admin area. Go to the plugin settings and carefully check if there is a checkmark next to the Unix cron item. You need to check the Cache Image checkbox in the affirmative so that the news parser also copies pictures to your resource.

The WP-O-Matic module is good because it works on any page of the site. You can add a separate category if you want the list of news and content provided by the parser to be displayed there. To do this, first create the necessary rubric. Then, in the WordPress admin, in the WP-O-Matic tool settings, click Add Campaign. In the Categories line, check the box for the special category you created. And in the Feeds form write RSS feeds, which you are going to parse. You can specify several URLs for feeds at once, so that the text parser collects information from three or even four resources at once.

And another huge plus in the direction of the WP-O-Matic plugin is the automatic publication of the material. You don't have to log into the WordPress admin every hour to change the status of posts to "Published". The module will do it on its own. And if you wish, it can uniqueize the text through a special synonymization mechanism. This is the key difference this tool from its competitor, the FeedWordPress plugin.

Datacol

This is a functional grabber that is suitable not only for the WordPress engine. This is not just a text parser for website pages - it is a smart application that allows you to filter the copied material. For example, you can only post articles that have certain keywords. You can duplicate news directly from Yandex. Cloned materials will be exported in one of 15 available formats. The service will collect not only text, but also headlines, photos, publication date, links and other important data.

But Datacol is distributed on a paid basis. However, it is much cheaper than if you ordered material on the site through exchanges. The application costs less than 500 rubles and can be used for almost any engine. There is a demo version.

FDE Grabber

Another paid parser with a lot of features. This is already from the category of expensive grabbers, since it will cost about $ 90. But it will be possible to use it on 10 servers at once, that is, in theory, different webmasters can chip in for $ 9, thus reducing the cost of the purchase.

FDE Grabber is not exactly a WordPress plugin. Developers call their creation autonomous system, which works regardless of the type of CMS installed on the site. The main features of this parsing system:

  • downloading full-scale news or individual fragments;
  • you can schedule publications;
  • if you want to uniqueize duplicated material, there is a built-in synonymize function;
  • you can work through proxy servers;
  • parsing is able to bypass redirects, which can be a problem for other plugins;
  • you can automatically download all the content from the site and move it to your site (if it's not about news portals);

You can set up parsing completely at your own discretion, since the program has the ability to introduce microprograms to correct the work. For example, this way you can adjust the alignment and design of the copied material. You can also add the noindex and nofollow parameters for all links that will be in the text of the page. The parser even allows you to copy and automatically translate articles from foreign resources. This great way create a constant flow of content to your pages that will start to attract visitors over time!

Parser for WordPress is a Datacol setting designed to collect content (news, articles, reviews, etc.) with further export to the WordPress CMS.

In this example, the resulting content is exported to . The results for each post are stored in separate file, whose title is generated based on the title of the post, and whose content is based on the parsed information. You can also set up direct export to your WordPress blog. How to do this is shown in the video.

You can check the work of the parser for WordPress for free in the demo version of the program.
The main advantages of Datacol-based parser for WordPress are:

  • The ability to customize parsing for WordPress specifically for your needs (by you or ).
  • The ability to additionally process the collected data using plugins and also upload them to .
  • Possibility of cyclic launch of campaigns. When the results of the first parsing task will be input to the second data collection task. Read more.

How is the WordPress parser used?

If you have created a blog, then after some time it becomes clear to you that in order to promote it, you constantly need a new one. unique content. And it becomes too lazy to write articles on your own and fill the site manually. But where to get new content? Sooner or later, the time comes when you want to resort to autoblogging. Simply put, use a parser that will publish the information we need on its own. The content parser for WordPress will help us to cope with this task.

All sites have ever been created from scratch. But if the purpose of your site is to provide information, it will only become interesting when the amount of information exceeds a certain amount of. The WordPress blog parser is a great solution for this task. With its help, you will be able to catch up with competitors in a short time, significantly expanding the catalog of information on the site. The WordPress parser can be implemented as a WordPress grabber from a specific site.

The WordPress Blog Parser can help with many tasks, here are some of them:
– initial filling of the resource (a site parser for WordPress will allow you to fill the blog with information from scratch to the required volume in the shortest possible time);
– creation of an auto-filled blog (the WordPress parser will be able to provide regular auto-updating of the site content)
— publication of content “on schedule” (you can schedule the time to add posts to your site)

Bulk Posting in WordPress

The WordPress parser provided in Datacol is a prime example of using a bulk posting parser. It allows you to automatically receive content and publish it on your blog. The WordPress parsing process can be divided into several steps:

1) The process of collecting content. The parser for WordPress collects the necessary information for each blog post: title, content (uploaded with pictures that are saved to your computer for further upload via FTP to your server), category, author and link from which the data was collected (URL).

2) Saving the information collected by the parser for WordPress. After parsing, the collected information is stored TXT files(each post is saved in a separate text file), whose names are generated in accordance with the title of the post.

3) Export WordPress. It is also possible to directly export the parsed information to your WordPress blog. This makes the filling process very fast and eliminates the possibility of human error. The ability to export to WordPress is provided in the basic functionality of the program. You need to specify the settings for connecting to your blog in the program settings and set the data for export (title, content, category, etc.)

4) Information processing. If desired, the information collected during the WordPress parsing process can be processed (for example, subjected to automatic translation or synonymization). These features are implemented using plugins.

How to parse a site on WordPress?

You can not only publish the parsed information in WordPress, but you can also parse from it. The task of using other people's WordPress blogs as a source of content for your blogs arises quite often. To solve it, a WordPress site parser will help you. The algorithm of the WordPress site parser is similar to the one described above.

Benefits of a WordPress site scraper

You have probably already seen that the WordPress blog parser will help you not to spend a lot of time and effort manually filling your blog. Thanks to it, you can not only automate your work, but also increase its efficiency. You can download the parser for WordPress implemented as part of Datacol by

Testing the blog parser

To test the blog parser:

Step 2. The campaign tree contains the content-parsers/kolchaka-net.par campaign. Select it and click the Play button. Before running, you can edit the Input Data. So you can set a link to the blog or blog pages from which you will parse content.

Step 3. Wait for the results of the blog parser to appear. After the results appear, you can forcibly stop parsing (by clicking the Stop button).

click on image to enlarge

Step 4. After the end / forced stop of the parser in the My Documents folder, you can find text files(each post is saved in a separate file), whose titles are generated based on the titles of the posts:

click on image to enlarge

I present to you universal WordPress Grabber WP UniParser. This plugin is universal custom parser. When creating posts, the plugin can translate content through the service Google translate using any language pairs.

Customer Reviews

The topic on the search, where there were about 6-7 reviews, was deleted by the moderators (they say the product does not meet the rules of the forum). Nevertheless, one review can be read on the mulnet and armada forum. There are also reviews from bloggers: here and here. Recently, I stumbled upon this review by accident.

Main functionality

The WP UniParser plugin I created can do the following:
pull content from sites on any engines(the parser is configured using regular expressions and restriction strings, the setup is very simple, I will explain and show everything, in addition, there is a );
cut out scripts, comments, links, forms, pictures, spans, objects, as well as any fragments you specify from the content.
schedule publication posts;
put parsed materials into a category you define (or randomly distribute them into categories);
realize automatic translation(in either direction) in any languages ​​supported by Google Translate.

You can learn more about the set of plugin functions in the screenshot of its admin panel:

Also, for a complete understanding of the operation of a universal grabber, it is worth.