May 4, 2011

Extracted website incomplete at import | CyberSEO Pro | Support Forum

Avatar

Lost password?
Advanced Search

— Forum Scope —




— Match —





— Forum Options —





Minimum search word length is 3 characters - maximum search word length is 84 characters

sp_TopicIcon
Extracted website incomplete at import
Topic Rating: 0 Topic Rating: 0 Topic Rating: 0 Topic Rating: 0 Topic Rating: 0 Topic Rating: 0 (0 votes) 
January 31, 2021
2:19 am
Avatar
inTempoDK
Member
Members
Forum Posts: 6
Member Since:
January 10, 2021
sp_UserOfflineSmall Offline

Hi, just tried to extract a page, most the data is missing.

This is what i see when i use direct link to extract data

Login to see this link

this is what i got

Login to see this link

As you can see, everything after the image is gone, why is this? this happens often (little things gone, all gone ect) different rss sources, is it not possible to just import all the data "raw" and then clean it yourself?

Correct me if im wrong but the advanced -> php command is done on the post after import already has happened correct?

January 31, 2021
2:30 am
Avatar
inTempoDK
Member
Members
Forum Posts: 6
Member Since:
January 10, 2021
sp_UserOfflineSmall Offline

Ah found out it does import all, IF i remove all the advanced -> php? commands

I had these

$post['post_content'] = preg_replace('/<div><img src=".*?" class="ff-og-image-inserted">\s*<\/div>/s', '', $post['post_content']);
$post['post_content'] = preg_replace('/<strong>.*?\?\)/s', '', $post['post_content']);

according to regex test site it should work fine with these two, so i dont know why that would remove all content after image.

January 31, 2021
2:50 am
Avatar
inTempoDK
Member
Members
Forum Posts: 6
Member Since:
January 10, 2021
sp_UserOfflineSmall Offline

And just ignore this question, for some reason first try it didnt show this, but the preg_replace actually had a hit before the text so it took tho who thing.

Sorry, my bad :/

While its a nifty function, its also dangerous :P

though have one more question, as Login to see this link doesnt seem to extract all data on some articles (not this plugins fault) is there a solution for that?

one example is:

Login to see this link

vs

Login to see this link

Any solution for this?

February 1, 2021
1:35 pm
Avatar
CyberSEO
Admin
Forum Posts: 3947
Member Since:
July 2, 2009
sp_UserOfflineSmall Offline

No, there is no solution for the Full-Text-RSS script. It's a 3rd-praty product and it is not included into the CyberSEO Pro distributive. You can use it as a stand-alone service under the GNU General Public License.

Forum Timezone: Europe/Amsterdam

Most Users Ever Online: 541

Currently Online:
4 Guest(s)

Currently Browsing this Page:
1 Guest(s)

Top Posters:

ninja321: 84

s.baryshev.aoasp: 68

Freedom: 61

Pandermos: 54

MediFormatica: 49

B8europe: 48

Member Stats:

Guest Posters: 337

Members: 2855

Moderators: 0

Admins: 1

Forum Stats:

Groups: 1

Forums: 5

Topics: 1640

Posts: 8352

Newest Members:

samuel2288, comercios.cercademi, wanmarkets, torontomark48, info.ckmedianetwork, contact.mybeautystar

Administrators: CyberSEO: 3947